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Preface 



Of making many bookes there is no 
end, and much studie is a weari- 
nesse of the flesh. 

Ecclesiastes XII, 12. 

When I first took an interest in the Geometry of Numbers, I was 
struck by the absence of any book which gave the essential skeleton 
of the subject as it was known to the experienced workers in the subject. 
Since then the subject has developed, as will be clear from the dates 
of the papers cited in the bibliography, but the need for a book remains. 
This is an attempt to fill the gap. It aspires to acquaint the reader with 
the main lines of development, so that he may with ease and pleasure 
follow up the things which interest him in the periodical literature. 
I have attempted to make the account as self-contained as possible. 

References are usually given to the more recent papers dealing with 
a particular topic, or to those with a good bibliography. They are given 
only to enable the reader to amplify the account in the text and are 
not intended to give a historical picture. To give anything like a reason- 
able account of the history of the subject would have involved much 
additional research. 

I owe a particular debt of gratitude to Professor L. J. Mordell, 
who first introduced me to the Geometry of Numbers. 

The proof-sheets have been read by Professors K. Mahler, L.J. 
Mordell and C. A. Rogers. It is a pleasure to acknowledge their 
valuable help and advice both in detecting errors and obscurities and 
in suggesting improvements. Dr. V. Ennola has drawn my attention to 
several slips which survived info the second proofs. 

I should also like to take the opportunity to thank Professor F. K. 
Schmidt and the Springer- Verlag for accepting this book for their 
celebrated yellow series and the Springer- Verlag for its readiness to 
meet my typographical whims. 

Cambridge, June, 1959 



J. W. S. Cassels 
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Notation 

An effort has been made to distinguish different types of mathemati- 
cal object by the use of different alphabets. It is not necessary to 
describe the scheme in full since an acquaintance with it is not pre- 
supposed. However the following conventions are made throughout the 
book without explicit mention. 

Bold Latin letters (large and small) always denote vectors. The 
dimensions is w, unless the contrary is explicitly stated ; and the letter n 
is not used otherwise, except in one or two places where there can be 
no fear of ambiguity. The co-ordinates of a vector are denoted by the 
corresponding italic letter with a suffix 1, 2, ..., «. If the bold letter 
denoting the vector already has a suffix, then that is put after the 
co-ordinate suffix. Thus; 

a = («i, •••,«») 

x; = (z(„...,z;,). 

The origin is always denoted by o. The length of x is 
|a5| = (xi-l h^)^- 

Sanserif Greek capitals, in particular A, M, N, F, denote lattices. 

The notation d{A), A(S^), V(S^) for respectively the determinant 
of the lattice A and for the lattice-constant and volilme of a set 
will be standard, once the corresponding concepts have been introduced. 

Chapters are divided into sections with titles. These sections are 
subdivided, for convenience, into subsections, which are indicated by a 
decimal notation. The numbering of displayed formulae starts afresh 
in each subsection. The prologue is just subdivided into sections without 
titles, and it was convenient to number the displayed formulae con- 
secutively throughout. 




Prologue 

Pi. We owe to Minkowski the fertile observation that certain 
results which can be made almost intuitive by the consideration of 
figures in «-dimensional euclidean space have far-reaching consequences 
in diverse branches of number theory. For example, he simplified the 
theory of units in algebraic number fields and both simplified and 
extended the theory of the approximation of irrational numbers by 
rational ones (Diophantine Approximation). This new branch of 
number theory, which Minkowski christened “The Geometry of Num- 
bers”, has developed into an independent branch of number-theory 
which, indeed, has many applications elsewhere but which is well worth 
studying for its own sake. 

In this prologue we first discuss some of the concepts and results 
which will play a leading role. The arguments we shall use are some- 
times rather different from those in the main body of the text: since 
here we wish to make the geometrical situation intuitive in simple cases 
without necessarily giving complete proofs, while later we may need to 
sacrifice picturesqueness for precision. The proofs in the text are inde- 
pendent of this prologue, which may be omitted if desired. 

P2. A fundamental and typical problem in the geometry of numbers 
is as follows: 

Let f(xi,...,x„) be a real- valued function of the real variables 
x^, ..., x„. How small can | /(mj, . . . , «„)| be made by suitable choice 
of the integers It may well be that one has trivially 

/(O, . . . , 0) =0, for example when f(x-^, x„) is a homogeneous form; 

and then one excludes the set of values Mj = = ■ • ■ = = 0. (The 

“homogeneous problem”.) 

In general one requires estimates which are valid not merely for 
individual functions / but for whole classes of functions. Thus a typical 
result is that if 

f{xi,x^) =a^-^x\ + + ('•) 

is a positive definite quadratic form, then there are integers not 

both 0 such that 

f{u„u,)^{ADI})K (2) 

where 

D = ^ 11^22 ^12 
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is the discriminant of the form. It is trivial that if the result is true 
then it is the best possible of its kind, since 

^ I 

for all pairs of integers % , Mj not both zero ; and here D = | . 

Of course the positive definite binary quadratic forms are a par- 
ticularly simple case. The result above was known well before the birth 
of the Geometry of Numbers; and indeed we shall give a proof sub- 
stantially independent of the Geometry of Numbers in Chapter II, §3. 
But positive definite binary quadratic forms display a number of argu- 
ments in a particularly simple way so we shall continue to use them as 
examples. 

P3. The result just stated could be represented graphically. An 
inequality of the type 

where /(%, is given by (1) and k is some positive number, represents 
the region 01 bounded by an ellipse in the {x^, aCj) - plane. Thus our 
result above states that 0t contains a point {u^ , u^) , other than the 
origin, with integer coordinates provided that A^(4£>/3)^- 

A result of this kind but not so precise follows at once from a 
fundamental theorem of Minkowski. The 2-dimensional case of this 
states that a region ^ always contains a point (% , u^) with integral 
co-ordinates other than the origin provided that it satisfies the following 
three conditions. 

(i) ^ is symmetric about the origin, that is if (xj, Xg) is in ^ then so 
is (-Xi, —x^). 

(ii) is convex, that is if (x ^ , .Vj) and (yj , y^) are two points of ^ 
then the whole line segment 

{Axi-b(l-A)yi, Axj-f (1 - A)y2} (O^A^l) 

joining them is also in 01. 

(iii) ^ has area greater than 4. 

Any ellipse /(xj, x^-^k satisfies (i) and (ii). Since its area is 

kn kn 

(^11 ®22 “ 

it also satisfies (iii), provided that kn>AD^. We thus have a result 
similar to (2), except that the constant (f)i is replaced by any number 
greater than Ajn. 

P4. It is useful to consider briefly the basic ideas behind the proof 
of Minkowski’s theorem, since in the formal proofs in Chapter 3 they 
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may be obscured by the need to obtain powerful theorems which are 
as widely applicable as possible. Instead of the region M, Minkowski 
works with the region of points where (a:^, X 2 ) is 

in Thus is symmetric about the origin and convex; its area is J 
that of ^ and so is greater than 1. More generally, Minkowski considers 
the set of bodies similar and similarly situated to ^ but 

with centres at the points (mj , Mg) with integer co-ordinates. 

We note first that if and ^’(mj, Mg) overlap then^ (mj, Mg) is in 
For let a point of overlap be (Ii,l 2 )- Since (^ 1 , 12 ) is in 6^{ui,u^ 
the point (li— «i, l 2 ~^ 2 ) must 
be in ■9'. Hence, by the symmetry 
of -9, the point {m^— M g— |g) is 
in 9. Finally, the mid-point of 
(Mj — li, Mg-fg) and (^ 1 , fg) is in 
because of convexity, that is 
(i , 2 W 2 ) is in 6^, and (mj , Mg) 
is in 9t, as required. It is clear 
that 5 ^(Mj, Mg) overlaps 9'{u'i, Mg) 
when and only when £9 overlaps 
■5^(Mj — Ml, Mg— Mg). 

To prove Minkowski’s theorem, 
it is thus enough to show that when 
the 9 (mj , M 2 ) do not overlap then 
the area of each is at most f. A 
little reflection convinces one that 
is given in Chapter 3- Another argument, which is perhaps more intuitive 
is as follows, where we suppose that 9^ is entirely contained in a square 

Ixgl^X. 

Let [7 be a large integer. There are {2V -\-\Y regions 9{ui, Mg) whose 
centres (Mi,Mg) satisfy 

|mi 1^[7, iMgl^’t/. 

These 9 (Mj , Mg) are all entirely contained in the square 

\x^\^U+X, \x2\^U+X 

of area 

A{U+XY. 

Since the 9 (mj , Mg) are supposed not to overlap, we have 

[2U + \YV^A{U ^X)\ 

1 The converse statement is trivially true. If (mj, Wg) is in M then (jMj.lMg) 
is in both 9 and 9’(u^,u^. 




this must be so. A formal proof 
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where V is the area of ; and so of each (u^, u^). On letting U tend 
to infinity we have F ^ 1 , as required. 



P5. A change in the co-ordinate system in our example of a definite 
binary quadratic form / (xj , x^) leads to another point of view. We may 
represent /(Xj, as the sum of the squares of two linear forms: 

f[x,.x,)=X\^-X\, (3) 

where 

= a Xj -f Xj , Xj = y -h <3 Xa (4) 



and a, y, b are constants, e.g. by putting 




We now consider as a system of rectangular cartesian co- 

ordinates. The points Xj , Xj corresponding to integers Xj , x^ in (4) are 
then said to form a (2-dimensional) lattice A. In vector notation A is 
the set of points 

(Xi,Xa) =«i(a,y) -b«a(^,(3), (7) 



where u^, Mj run through all integer values. 

We must now examine the properties of lattices more closely. Since 
we consider A merely as a set of points, it can be expressed in terms 
of more than one basis. For example 

(«-^, y-6), (-/3, -<3) 

is another basis for A. A fixed basis (a, /3) , (y, 6) for A determines 
a subdivision of the plane by two families of equidistant parallel 
lines, the first family consisting of those points (Xj, Xj) which can be 
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expressed in the form (7) with integral and Mj only real, while for 
the lines of the second family the roles of and Mj ^re interchanged. 
In this way the plane is subdivided into parallelograms whose vertices 
are just the points of A. Of course the subdivision into parallelograms 
depends on the choice of basis, but we show that the area of each 
parallelogram, namely 

|a(5 — /5y| , 

is independent of the particular basis. We can do this by showing that 
the number N{X) of points of A in a large square 



satisfies 



J(X): \X^\<X 



-v(-y) y 

4A-2 |a(5-/3y| 



(Z- 



Indeed a consideration along the lines of the proof of Minkowski’s 
convex body theorem sketched above shows that the number of points 
of A in .2 (X) is roughly equal to the number of parallelograms contained 
in j2 [X) , which again is roughly equal to the area of ^ {X) divided by 
the area \ccd—^y\ of an individual parallelogram. The strictly positive 
number 

d{A)=\oLd-^y\ ( 8 ) 



is called the determinant of A. As we have seen, it is independent of 
the choice of basis. 



P6. In terms of the new concepts we see that the statement that 
there is always an integer solution of /(%j, X 2 )^{ 4 DI})^ is equivalent 
to the statement that every lattice A has a point, other than the origin, in 

Z? + Zl^(|)l^f(A). (9) 

On grounds of homogeneity this is again equivalent to the statement 
that the open circular disc 

Z?+Z|<1 (10) 

contains a point of every lattice A with (i(A)<(|)^ and the fact that 
there are forms such that equality is necessary in (2) is equivalent to 
the existence of a lattice \ with determinant i(AJ =(|)^ having no 
point in So our problem about all definite binary quadratic forms 
is equivalent to one about the single region ^ and all lattices. Similarly 
consideration of the lattices with points in 
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gives us information about the minima of indefinite binary quadratic 
forms; 

inf I/(wi,M2)|: 

«i ,«2 integers 
not both 0 

and so on. 

These considerations prompt the following definitions. A lattice A 
is said to be admissible for a region (point-set) 0 t in the Z2)-plane 
if it contains no point of 01 other than perhaps the origin, if that is a 
point of We may say then that A is ^-admissible. The lower bound 
A {^) of d (A) over all ^-admissible lattices is the lattice-constant of : 
if there are no ^-admissible lattices we put A {R) = oo. Then any 
lattice A with d[N)<A [R) certainly contains a point of ^ other than 
the origin. An ^-admissible lattice A with d{N)=A {^) is called critical 
(for Si ) : of course critical lattices need not exist in general. 

The importance of critical lattices was already recognized by 
Minkowski. If \ is critical for ^ and A is obtained from \ by a 
slight distortion (i.e. by making small changes in a pair of base-points) 
then either A has a point in ^ other than the origin or d{/\)^d{\) 
(or both). 

As an example, let us again consider the open circular disc 

X\+X\<\. 

Suppose that \ is a critical lattice for We outline a proof that a 
critical lattice, if it exists, must have three pairs of points ±(Aj,T2), 
±(Q.C2) on the boundary X\-\-X\ = \ of S’. For if A,, 
had no points on Af -f A| = 1 , we could obtain an ^-admissible lattice 
with smaller determinant from A^ by shrinking it about the origin, that 
is by considering the lattice /\ = t\ of points {tX^.tX^), where 
(Aj,A 2)6A and 0<^<! is fixed. Then d(/\) =t^d{A.)<d{A^), and 
clearly A would be also .^-admissible if t is near enough to 1 . Hence A,, 
contains a pair of points on Af -f A| = 1 , which, after a suitable rotation 
of the co-ordinate system, we may suppose to be ±(t, 0). If there were 
no further points of A^ on Af A| = 1 then we could obtain a 2 - 
admissible lattice A of smaller determinant by shrinking A^ perpendicular 
to the Aj-axis, that is by taking A to be the lattice of (Ai,/A 2), 
(Aj , A2) 6 A ^ , where t is near enough to 1 . Finally, if A^ had only two 
pairs of points i(1,0), Az(Bi, B2) on the boundary, then it is not 
difficult to see that it could be slightly distorted so that (1, 0) remains 
fixed but (Bj , B2) moves along A? -f A| = t nearer to the A^-axis, cf . 

Fig- 3 - 

This can be verified to decrease the determinant of the lattice 
[indeed (1, 0) and (Bj, B^) can be shown to be a basis for AJ, and for 
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small distortions the distorted lattice A will still be ^-admissible. 
Hence a critical lattice (if it exists) must have three pairs of points 
on Zj -f-Zj = 1 : and it is easy to verify that the only lattice with three 
pairs of points on Xl+Xl = \, one of them being ±(1, 0), is the lattice 
A' with basis 

. 0) . (i , 1/f ) • 

This has the vertices of / \ 

a regular hexagon / \ 

±( 1 , 0 ), / \ 

±(i.yf). I 0(0,0} 

±(-iyi) ^ y 

onXi-fZ 2 = 1, but no \ / 

points in X\+X\<\. \ / 

We have thus shown N. 

thatZl(£»)=(f(A')=(|)4 

provided that ^ has 

a critical lattice. Min- p,g 3 

KOWSKi showed that 



critical lattices exist for a fairly wide set of regions ^ by, roughly speak- 
ing, showing that any ^-admissible lattice A can be gradually shrunk 
and distorted until it becomes critical. In the text we give a more 
general proof of the existence of critical lattices using concepts due to 
Mahler which turn out to have much wider significance. 



P/. Another general type of problem is the typical "inhomogeneous 
problem”: Let f[x^, x„) be some real-valued function of the real 

variables x^, x„. It is required to find a constant k with the following 

property : If are any real numbers there are integers Mj , . . . , 

such that 

l/(fl-Mi,...,f„-M„)| gA:. 

Questions of this sort turn up naturally, for example in the theory 
of algebraic numbers. Again there is a simple geometric picture. For 
simplicity let w = 2. Let ^ be the set of points (% , x^) in the 2-dimen- 
sional euclidean plane with 

Denote by ^(mj, m^), where %, «2 are any integers, the region similar 
to ^ but with the displacement that is ^(Wj, Mj) is the set of 

points , x^ such that 

I / (% ^1 , ^2 *< 2 ) I ^ ^ . 
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Then the inhomogeneous problem is clearly to choose k so that the 
regions ^ (mj , cover the whole plane. In general one will wish to 
choose k, and so 0t, as small as possible so that it still has this covering 
property. Here we have a contrast with the treatment of the homo- 
geneous problem in § 4, where the objective was to make the regions 
[there denoted by y (m, v)] as large as possible but so that they did not 
overlap. 

In this book we shall mainly be concerned at first with the homo- 
geneous problem. Only when we have a fairly complete theory of the 
homogeneous problem will we discuss in Chapter XI the inhomogeneous 
problem and its relation to the homogeneous one. 




Chapter I 



Lattices 



1.1. Introduction. In this chapter we introduce the most important 
concept in the geometry of numbers, that of a lattice, and develop some 
of its basic properties. The contents of this chapter, except § 2.4 and 
§ 5, are fundamental for almost everything that follows. 

In this book we shall be concerned only with lattices over the ring 
of rational integers. A certain amount of work has been done on 
lattices over complex quadratic fields, see e.g. Mullender (1945 a) and 
K. Rogers (1955a). Many of the concepts should carry over practically 
unaltered. Again, work on approximation to complex numbers by 
integers of a complex quadratic field [e.g. Mullender (1945 a), Cassels, 
Ledermann and Mahler (1951a), Poitou (1953 a)] and on the minima 
of hermitian forms when the variables are integers in a quadratic field 
[e.g. Oppenheim (1932a, 1936a, 1953 f) and K. Rogers (1956a)] may 
be regarded as a generalization of the geometry of numbers to lattices 
over complex quadratic fields. We shall not have occasion to mention 
lattices over complex quadratic fields again in this book; we mention 
them here only for completeness. For lattices over general algebraic 
number fields see Rogers and Swinnerton-Dyer (1958 a). 

1.2. Bases and sublattices. Let Cj, ..., a„ be linearly independent 

real vectors in n-dimensional real euclidean space, so that the only set of 

numbers q,...,f„ for which -{-t„a^ = o is = ■■ ■ =t,^ = 0. 

The set of all points , , 

+ +«»«» ( 1 ) 



with integral Mj , . . . , is called the lattice with basis Oj , . . . , a„ . We 
note that, since %, ...,a„ are linearly independent, the expression of 
any vector x in the shape (1) with real Mj, . . . , is unique. Hence if x 
is in A and (1 ) is any expression for x with real u^, then Mj , . . . , m„ 

are integers. We shall make use of these remarks frequently, often 
without explicit reference. 

The basis is not uniquely determined by the lattice. For let a] be 

the points , „ 

a. = 2 Vi, a, (i^t, j^n), (2) 



where v^j are any integers with 

det{Vii) 



± 1 . 



(3) 
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Then 



a. = 2 It',-, a' 
; 



( 4 ) 



with integral It follows easily that the set of points (1) is precisely 
the set of points 

u[a[ + ••• +w>' 



where u[, run through all integers ; that is Oj , . . . , a„ and a[, 

are bases of the same lattice. We show now that every basis a' of a 
lattice A may be obtained from a given basis a- in this way. For since 
o' belongs to the lattice with basis Oj, . . . , o„ there are integers such 
that (2) holds; and since o,- belongs to the lattice with basis a[, 
there are integers such that (4) holds. On substituting (2) in (4) 
and making use of the linear independence of the a-, we have 



[ 1 if j = / 
otherwise. 

Hence 

det(w;^) det(iiy,) = 1 

and so each of the integers det (w^j) and det (f^,) must be 1 ; that is 
(3) holds as required. 

We denote lattices by capital sanserif Greek letters, and in particular 
by A, M, N, r. 

If Oj, ...,o„ and oj, ...,a' are bases of the same lattice, so that 
they are related by (2) and (3), then we have 

det(aj, ..., 0 ^) =det(i;.,)det(ai,...,aj = ± det(Oi, ...,o„), 

where, for example, det(ai, ..., a„) denotes the determinant of the 
nxn array whose j-th row is the vector Uj. Hence 

d(A) =|det(ai,...,aj| 

is independent of the particular choice of basis for A. Because of the 
linear independence of Oj , . . . , a„ we have 

d(A)>0. 

We call i(A) the determinant of A. 

An example of a lattice is the set Aq of all vectors with integral 
coordinates. A basis for Aq is clearly the set of vectors 

( /—I zeros n— y zeros\ 

0 , ...,o, i,6yry7oj 



d(Ao) = 1 . 



and so 
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We note that the vectors of a lattice A form a group under addition : 
if aiA then — aeA; and if a, 6eA then od;b€A. We shall see later 
(Chapter III, § 4) that a lattice is the most general group of vectors in 
«-dimensional space which contains n linearly independent vectors and 
which satisfies the further property that there is some sphere about 
the origin o which contains no other vector of the group except o. 



1.2.2. Let ..., a„ be vectors of a lattice M with basis 6^, b, , 

so that vn . 

ai = Zvi,b.j ( 1 ) 

) 

with integers v^j. The integer 

7 = |det(y -)| — o«)| _ |det(o, n„)l 



|det(bj b„)| d(M) 

is called the index of the vectors a^, ...,a„ in M. From the last ex- 
pression it is independent of the particular choice of basis for M. By 
definition, 7^0; and 7 = 0 only if aj,...,a„ are linearly dependent. 

If every point of the lattice A is also a point of the lattice M then 
we say that A is a sublattice of M. Let Oj, ..., a„ and 6j, ..., b„ be 
bases of A and M respectively. Then there are integers such that (1) 
holds, since a,cM. The index of Oj, ...,c„ in M, namely 



7)=|det(i;,,)| 



_ |det(ai a„)\ 

|det(bi b„)| 



d{A) 

d(M) 



( 2 ) 



is called the index of A in M. From the last expression the index depends 
only on A and M, not on the choice of bases. Since a^, are 

linearly independent, we have D>0. On solving (1) for the b, and 
using (2), we have 

Dbi = l,Wija^, 
i 

where the are integers. Hence 



7)M C ACM, 



(3) 



where 7>M is the lattice of Db, beM. 

It is often convenient to choose particular bases for A and M so 
that (1) takes a particularly simple shape. 

Theorem I. Let Ale a sublaitice of M. 

A. To every base b^, ..., b„ o/ M there can be found a base a^, ...,«„ 
of A of the shape 

= ^1 1 ^1 

~ ^21 ^1 "F ^22 ^2 I 



— ^Hl^l + ^nnb„ , 

where the are integers and r',-,=|=0 for all i. 
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B. Conversely, to every basis of A there exists a basis 

6j , . . . , 6„ 0 / M such that (4) holds. 

Proof of A. For each i (1 ^i^n) there certainly exist points C; in 
A of the shape 

«, = \-Vi,bi 

where I'n, f,, are integers and since, as we have seen, 

Db, ~A. We choose for such an element of A for which the positive 
integer is as small as possible (but not 0), and will show that 
, . . . , o„ are in fact a basis for A. Since «i , . . . , a„ are in A, by con- 
struction, so is every vector 

(5) 

where w^, ... ,w„ are integers. Suppose, if possible, that c is a vector 
of A not of the shape (5). Since c is in M, it certainly can be expressed 
in terms of 6j , . . . , b„ , and so can be written in the shape 

c = f>i + ■•■+</, bjj , 

where /*=|=0 and t^, ....t,^ are integers. If there are several 

such c, then we choose one for which the integer k is as small as pos- 
sible. Now, since Vj.j,4=0, we may choose an integer s such that 

(6) 

The vector 

c-sai, = (f, - snii) bi -1 f- (4 - sv^i,) 

is in A since c and are; but it is not of the shape (5) since c is not. 
Hence sy,tj=|=0 by the assumption that k was chosen as small as 
possible. But then (6) contradicts the assumption that the non-zero 
integer Vf,^ was chosen as small as possible. The contradiction shows 
that there are no c in A which cannot be put in the form (5), and so 
proves part A of the theorem. 

Proof of B. Let Oj, ...,a„ be some fixed basis of A. Since Z)M 
is a sublattice of A by (3), where D is the index of A in M, there exists 
by Part A a basis Db^, . . . , Db„ of DM of the type 

Dbi = w^i Oj 
Dbj = -b ze'aaOa 

DK = w„^a^ + ■■■ +w„„a„ 

with integral w^j and Wii=b0 On solving (7) for a^, ...,a„ 

in succession we obtain a series of equations of the type (4) but where 
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at first we know only that the f,, are rational. But clearly bj, .... Z>„ 
are a basis for M and so the are in fact integers, since the a, are 
in M, and since the representation of any vector a in the shape 

a = /j &1 4- • • ■ + b„ (fj , . . . , , real numbers) 

is unique by the independence of b^, 

From this theorem we have a number of simple but useful corollaries. 
Corollary 1. In theorem I we may suppose further that 



vn>0 ( 8 ) 

and that 

0 ^ Vij < v.jj in case A , (9) 

0 ^ I’,,' < f, , in case B . (10) 

Proof of A. To obtain (8) it is necessary only to replace O; or b- 
by — a, , — 6, respectively if originally v, , < 0. To obtain (9) we replace 
the O; by 

where the are integers to be determined. For any choice of the t^- 
the a'i are a basis for A. We have 





where 

and, for/<f, we have 

+ ^i,?+i^7+i,7 + ■ ■ ■ + A-i,;' + 

For each i we may now choose fj.i-i, ..., in that order so that 

as was required. 

Proof of B. Similar. 

Corollary 2. Let be linearly independent vectors of a 

lattice M. Then there is a basis b^, ...,b„ of M such that 



®i ~ 1 ^1 

(I2 = ^2 j bi -j- L'2 2 b 2 



“m — H + 

with integers v,j such that 



(1 ^ / < 1 ^ 1”) • 



( 11 ) 
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VVe can choose vectors in M such that a^, are 

linearly independent. Corollary 2 follows now on applying Corollary 1 
to the lattice A with basis a^, 

Corollary}. Let Oj, ...,a„ {m<n) be linearly independent vectors 
of a lattice M. A necessary and sufficient condition for the existence of 
vectors ..., a„ in M such that a^, is a basis is the following: 

every vector c€M which is of the shape 

c=u^Oi + --- +u„a„ (12) 

with real Mj , . . . , necessarily has u^, ■■■ ,u^ integral. 

If Oj , . . . , a„ is part of a basis a^, ....an the condition is clearly 
satisfied. Conversely if a^, ...,«„ satisfy the condition, let bj, ...,5„ 
be the basis of M given by Corollary 2 and let v^j be the corresponding 
integers. Then c = b^, are of the shape (12) and indeed the 

coefficient of O; in the expression for 5,- is vf]^. Hence v,, = 1 and so 
v^^=0 for f=t=/, that is a—bj (Igf^w) and we may put a^^bj 
(m-i-i^i^n). 

In some contexts we shall need the following more specialized 
corollary which follows at once from Corollary}. 

Corollary 4. Let b^, ...,b„ be a basis for a lattice M and let 

c = «ibi4 \-u„b„cM. 

A necessary and sufficient condition that 

bi, . . . , b„,_j, c 

be part of some basis 

bi , . . . , b„,_i , c, . . . , c„ 

of M is that m„, .... m„ have no common factors 4= ±1. 

Proof. Clear. 

The following characterisation of the index of a sublattice A of a 
lattice M is sometimes useful. We say that two vectors c, d of M are 
in the same class with respect to A if c — d is in A. Clearly this is a 
subdivision into classes : if c — d and d — e are in A, then c — e is in A. 

Lemma 1. The index of the sublattice A o/ M is the number of classes 
in M with respect to A. 

For let a, , bj be bases for A and M respectively in the shape (4) 
given by Theorem I. Then clearly the index D of A in M is 

D = 

i 
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But now every cf M is in the same class as precisely one of the vectors 

as is readily verified (cf. proof of Theorem I, Corollary 1). 

1.2.3. There is a useful transformation of the criterion of Theorem I, 
Corollary 3 , for deciding whether or not a set of vectors a^, a,„ 
(m<n) of a lattice A can be extended to a basis for A. 

Lemma 2. Lei b^, be a basis for a lattice A and let 

«<= 2 ^ijbj (\^i^m) (1) 

l-Sj 

be vectors of A. A necessary and sufficient condition that a^, ...,a„ be 
extendable to a basis a^, a„ of A is that the mxm determinants formed 
by taking m columns of the array 

(ti,y) (1 ^ t ^ m, 1 ^ ^ m) (2) 

shall not have a common factor. 

The condition is certainly necessary. For let a„+i,...,a„ form a 
basis with a^, so that 

= 2 bj {m+\^i^n) ( 3 ) 

iS;S» 

for some integers t;,,-. Since a, (i^i^n) and 6; (i^i^n) are bases 
of the same lattice, we have 

det(i^,.^) = ±1. (4) 

We may expand the determinant (4) by the first m and last {n — m) 
rows [Laplace-development] and obtain 

2 KW,==det(v,,), (5) 

l^r^R 

where the are the determinants formed from columns of (2) 

and W, is the (n — m)x{n — m) determinant formed from the remaining 
(n — m) columns and the (n — m) rows, 

(m<i-^n, i^j<n), 

taken with an appropriate sign. Since the are integers, it follows 
from (4) and (5) that the have no common factor. 

The condition is also sufficient. For let c be a vector of A of the 
shape 

c = MiOi H h u„a„ 



( 6 ) 
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for real numbers u^, On inserting (1) in (6) we have 

V M.-i;,.,- = integer = (say) (7) 

l<iSm 



since bj, b„ is a basis for A. We may solve (7) for the m,, and 
indeed in a multitude of ways. For example let be the cofactor of 
in the expansion of the determinant 



Then 



Ti - det (ti,y) (1 ^ t ^ in, 1 ^ ^ m ) . 

2 v^h=^V,u„ 



so Mj is an integer. Similarly 



1^ M; = integer (t ^ i g i«) , 

where is any w X )w determinant formed from (2). Since, by hypothesis, 
the are integers without common divisor, the m, must be integers. 
Hence by Theorem I, Corollary 3 it is possible to extend Oj, to 

a basis a^, a 

1 . 2 . 4 ^. We shall now apply Lemma 2 to obtain a result of Daven- 
port (1955 a) about the way in which a basis for a lattice may be chosen. 
This will be used only in Chapter V, § 10 and then only to prove a 
result on Diophantine Approximation rather aside from the main theme 
of the book. 

Theorem II. Let A he an n-dimensional lattice, let 
C; (l^f^M — 1) 

be (« — 1) arbitrary real vectors and let e>0 he an arbitrarily small real 
number. Then for all real numbers N greater than a number depending 
only on A, e and the c,, there exists a basis a^, . . . , a„ of t\ such that 

|a, — Ac,;| < A' (l^f^w — 1). (1) 

Here 

\x\ = [xl + ■ ■ ■ i- xl)^ (2) 

denotes the usual euclidean distance. 

To prove Theorem II we shall need a result about the distribution 
of integers prime to a given integer. We prove this first, and then 
Theorem II. 

Lemma 3. For each d>0 there is a number k{b) with the following 
property: Every interval of length k{6)q^, where q is a positive integer, 
contains an integer prime to q. 

1 § 2.4 may well be omitted at a first reading 
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Let 

9 = n Pf, (3) 

iS)S/ 

where the are distinct primes and the a^>0 are integers. An integer 
is prime to q if and only if it is not divisible by any of p^, ...,pj. 
Consider some interval 

V<u^V+U (4) 

of length U, where U, V are fixed integers. Vot jy<j^<---<j^, where 

s^/, let 

be the number of integers u in the interval (4) which are divisible by 
pj^, pj^, ..., pj^ (and perhaps also by other primes from p^, ...,pj). We 
show next that 

W = U+Z[-\YM[n.....u) (5) 

s>0 

gives the number of integers u in (4) prime to q, where U is the number 
of integers u in (4). For let the integer u be divisible by precisely r 
primes pj, where say by pi,---,pr, but not by p,^^,...,pj. 

Then u is one of the integers counted in M(ji , . . . , /s) if and only if 

s^r and /j , . . . , is one of the Q combinations of s out of the numbers 

1 , 2, .... r. Also u contributes i to U regarded as giving the number 
of integers in (4). Hence the total contribution of m to (5) is 



1 






If, however, u is prime to q, then it contributes t to U but does not 
contribute to the M(/i , . . . , ; J ; so W is the number of integers in (4) 
prime to q, as asserted. But 

u 






K--Pi. 

since M(/j , ■ . . , is the number of integers 

« =pu---Piy’ 

where «' is an integer and 



< 1 . 



<m'^ 






Pu -Pu Pu-'-Pi. 

Since (5) contains 2-^ summands, we have 



w>u\\+ 2 ] 



(-1)^ 
s>o 

h<-<i, 

Cassels, Geometry of Numbers 



■ U[J (l - -2P^2~PU- 2L 




18 



Lattices 



Hence there is an integer prime to q in the required interval provided 
that 






If 6 is the arbitrarily small number given in the lemma, we now have 




(say) . 



where the second product is taken now over all primes less than 4'^'*. 
This proves the lemma. 

We shall use the lemma in the following apparently more general 
shape. 

Corollary. Let q, d, k(6) be as in the lemma and let s, t be integers 
of which t is prime to q. Then an interval of length greater than k{S)q^ 
contains an integer u such that tu-\-s is prime to q. 

For since t is prime to q we may write 



s =s-^t +s^q 

for integers and Sj. Then 



+ s = <(m + Sj) + s^q- 



Since t is prime to q we need only choose u so that « + is prime to q ; 
and this is possible by the lemma. 

We now revert to the proof of Theorem II. Let b^, be any 

basis for A, and let the given vectors c, be 





c.= 2 ya^i [\^i^n-\) 

1 ^ ; S » 


(6) 


for real numbers 


y^j. We shall choose a basis 








(7) 


/ 

for A such that 




,.. = Ary.. + 0(iV^), 


(8) 



where A^>t is the given positive number, d>0 is arbitrarily small, and 
the constant implied by the 0 symbol may depend on n, d and the y,y. 
We shall choose the so that for each I<n the two integers 

i?y = det(v,y) 1^/^/) 

and 

Sj = det(v,,) (f^f^/, 2^/^/ + 1) 

are non-zero and without common factor. 

Suppose, first, that I =i. We take for one of the integers nearest 
to Nyii which is not 0. Next we choose for Vj 2 the integer nearest to 
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Nyi 2 which is not 0 and prime to tin. For j>2 we choose for v^j the 
integer nearest to Ny^j. Then (8) holds for i = l and /#=2 trivially 
and for i = \, j =2 by Lemma}, and since clearly Vn = 0{N). The 
integers R^ = Vn and Si=^Hj 2 have the required properties. 

Now suppose that I>\, and that the with i<I have already 
been constructed. For /=t=/. I + \ we take for Vjj just the nearest 
integer to Nyjj. On expanding R, and by their last rows, we now 
have 

•^7 = i ^ . 

■^7 = i *^7,7+1-57-1 + *'/7^ + 

where A, B,C are integers which have already been determined. Since 
f?/_i is prime to Sy_i, we may choose the integer Vjj so that R, is not 0 
and prime to We choose for Vjj the integer nearest to Ny^j for 

which this is true, so that, by the corollary to Lemma }, 

Vu-Nyjj = 0(SU)=0{N^'-^^^), 

since S^_i = 0(iV*“*), being a sum of products of 7 — 1 numbers each 
of order N. Having determined Vjj we now take for the integer 

nearest to iVy/,/+i such that Sj is not 0 and prime to Rj, so that 
similarly 

t'Li+i 77,7+1 = 0(S?)=0(iV^^). 

This completes one stage of the induction. We have thus shown the 
existence of integers satisfying (8). 

From (7) and (8) we have 

|a,. — Nc,-| =0(A^<”“b<S) — 1). 

The truth of the statement of the theorem now follows on taking d = ejn. 

1.3. Lattices under linear transformation. It is convenient here to 
consider briefly the effect of a non-singular affine transformation 
x~^X = ax of ^-dimensional space into itself. Let the transformation 
X — ax be given by 

^i= Z ( 1 ) 

where 

X={X^,...,X„), x=^ixi,...,x„) 

are corresponding points in the transformation and a,y are real numbers 
such that 

det(o) = det(a,y) =t= 0. 

Let A be a lattice and denote by a A the set of points ax, ®€A. 
If , . . . , b„ is a basis for A, then the general point b = m, 6^ -| h b„ 

2 * 
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(% , integers) of A has the transform 

a6 =o(«i6iH \-u„b„) =«iabiH \-u„ab„. 

Hence a A is a lattice with basis ab^, ...,ab„, and 

(a A) = I det (a bj , . . . . a 6„) I = I det (a) 1 1 det (bj , . . . , 6J | = | det (a) j tf (A) . 

We note two particular cases. First, if /^=0 is a real number, then 
the set of /b, be A is a lattice of determinant |f|”(f(A) which we shall 
denote by / A. Secondly every lattice M can be put in the shape M = a Aj, , 
where a is of the type (1) and Aq is the lattice of integer vectors. For 
if Oj, ..., a„ is any basis for A, we may define a^j by 

Qj (^ 1 ; t • • • 1 ^nj) • 

1.4. Forms and lattices. We consider first quadratic forms. Let 

n 

/(*) (/./ = /;/). ( 1 ) 

where 

a; = (^i,. (2) 

be a non-singular quadratic form of signature^ {r, n — r)\ that is, there 
exist independent real hnear forms 

Xi= ^ (3) 

such that identically 

n^)=<p{X). (4) 

where 

X = (X„...,X„) (5) 

and 

-f XI (6) 

(for r=0,n there are no positive or negative squares respectively) . 
We have clearly 

det(/,,) = ±{det(i,,.)P. (7) 

Conversely, if is any set of real numbers with det(i„) =)=0, then (3), 
(4) and (6) determine a quadratic form (1) of signature (r,n — r) and 
(7) holds. We shall be concerned a great deal with the values which 
f{x) takes when Xj, ..., are integers. By (3), these are precisely the 

^ Many writers define the signature of a form to be the number of positi\ e 
squares less the number of negative squares in (6), i.e. 2r — n. But it is more 
convenient to give e.xplicitly the number of each kind of square than to do the 
arithmetic every time. 
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values which (p{X) takes when X runs through the vectors of the 
lattice A with basis 

~ ■■■ > ^nj) ■ 

Then, by (7), we have 

{i(A)}2 = ldet (/,,)! . (8) 

In this way statements about different quadratic forms of signature 
{r, n — r) at integral values are equivalent to statements about the 
single form (p[X) and different lattices. For later reference we formulate 
a typical result as a Lemma. 

Lemma 4. The following four statements about a number x are equi- 
valent, where 

ep{X)=X\ + ---+X^-XUv Xl 

(i) In every lattice A there is a vector A=^o with 

\cp(A)\^x{d{A)ri’'. 

(ii) In every lattice A of determinant 1 there is a vector A^o with 

\q>{A)\^K. 

(Hi) In every lattice A of determinant d(A)^x~’'^^ there is a vector 
AH=o in 

\(p(A)\^i. 

(iv) For every quadratic form signature [r,n — r) there 

is an integer vector a=j=o such that 

\f{a)\^y.\dtnU,)Yl\ 

That (i), (ii) and (iii) are equivalent follows from homogeneity, since 
q>{tX) =t^<p[X) and since the set tA of all tX (X^A) is a lattice tA of 
determinant |f|”i(A); and we may choose t so that f'd(A)=i. That 

(iii) and (iv) are equivalent follows at once from the earlier discussion 
and, in particular, from (8). 

The foregoing argument is quite general. For example the behaviour 
for integer values of the variables of any form f{x) of degree n which 
can be expressed as the product of n real linear forms: 

fi^) = TT + -\-dj„x„) 

is equivalent to the behaviour of the function 

(p{X)=X,...X„ 

at the points of an appropriate lattice A. A single function q>{X) cor- 
responds to the set of all functions f{x) that can be deduced from it 
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by a real non-singular affine transformation 

X. = 2 ; real, det (<f„) 4= 0) . 

1.4.2. Of course the form (p{x) and the lattice A do not determine 
the function f{x) uniquely, since f{x) depends on the choice of a 
particular basis for A; and we shall discuss this ambiguity here. The 
transformation , 

i 

of § 4.1 is just of the type 

X = ax 



discussed in § 3. Identifj'ing these transformations we see that 

A = a Ao , 

where Aq is the lattice of all integer vectors; the particular basis 



di,...,d„ 

of A corresponding to the basis 

e, = 1, (\£j^n) 

of Aq. Hence any other basis dj , . . . , <i' of A is of the shape 



d' = ae;. 



where e' is some other basis for Aj. Let /' be the form corresponding 
to the basis d'j as / does to d^. Then clearly there is the identical relation 

f'(x') =j'{x[,...,x'„) =(p{x[d[^ \-x'„d’„) =f{x[e'i + --- + x'„e’„) . 



But now since e' is a basis for we have 



e] = (vj,., . . . , v „,) , 
where the v^j are integers and 



so that identically 
if 



det(v;,.) = ±1 : 

/'(*')=/(*) 

Xi = ZVijX’,. 

7 



( 1 ) 

( 2 ) 

(3) 



Conversely, if the are integers such that (1), (2), (3), hold then /' 
and / correspond to the same lattice A. Two forms in this relationship 
are said to be equivalent; they take the same set of values as the 
variables run through all integral values, since, by (1) and (3), integral 
x' correspond to integral x and vice versa. 
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It is sometimes useful to distinguish between det (i;,y) = + 1 (proper 
equivalence) and det(i<,^) = — t (improper equivalence) in (1). We shall 
not do this, however, since it does not correspond to anything intrinsic 
in the corresponding lattices. 



1.4.3. The forms f{x) and <p{X) do not in general determine the 
lattice uniquely, since for e.xample a quadratic form f[x) of signature 
{r, s) with r + s = M may be expressed in the shape 

Xl + --- 

in many different ways. Let aj,...,a„ and 5i,...,6„ be bases of 
lattices A and M respectively and suppose that 

95 = (1) 

for all integral «< = («!, . . . , u„). Since ^(X) is a form, (1) is an identity 
in the variables % , . . . , m„ . Let ca be the uniquely determined homo- 
geneous transformation such that 



Then 

for all u, and so 

cp{X)=<p[ii,X) (2) 

for all A', by (1) and since every vector is of the shape X='^Uja^ 
for some real numbers Uj. If the homogeneous transformation u) 
satisfies (2) we call it an automorph of 99. We have just shown that if 
(1) holds there is an automorph to of 9) such that u>a~bj. The con- 
verse is, of course trivial that if to is an automorph of 99 and toOy=bj 
then (1) holds. 

We shall study the automorphs of forms intensively in Chapter X. 



1.5. The polar lattice^. We denote the scalar product of two n- 
dimensional vectors *, y by 

acy=%yiH \-x„y„. (1) 



Let bj , . . . , b„ be a basis of a lattice A. Since the by are linearly 
independent, there exist vectors bf such that 



bfb. 



1 if i = j 
0 otherwise . 



( 2 ) 



1 This section will not be referred to until Chapter ^TII and will not be of 
importance until Chapter X and XI. 
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The lattice A* with basis b* is called the polar (or dual or reciprocal) 
lattice of A, and b* is the polar basis to bj. The polar lattice A* of A 
is independent of the choice of the particular basis, as we now show. 

Lemma 5- The polar lattice A* oj A consists of all vectors a* such 
that a* a is an integer for all a in A. Then A is conversely the polar 
lattice of f\*. Further, 

d[^)d[^*) =1. 

Suppose, first, that 

«* = Z K ' « = Z *»;■ 

are in f\* and A respectively, so that the Uj, Vj are integers. Then 

o*a = 2 

is an integer. Now let c be any vector such that ca is an integer for 
all a in A. In particular 

c by = Uj (t ^ ^ n) 
is an integer. Put a*='^Ujbf. Then 

(c — a*)bj = 0 (t^/^«); 

and so c=a* since the by are linearly independent. This proves the 
first sentence of the theorem. The second sentence follows immediately 
from the first and also from (2). Finally, (2) implies that 

det (b*, . . . , b*) det (bj , . . . , b„) = 1 , 

and so d(A*)d{A)=i. This concludes the proof of the lemma. 

1. 5.2. When j/=)=o is fixed, the points x such that yx=0 he in a 
hyperplane through o. 

Lemma 6. A necessary and sufficient condition that there he n — i 
linearly independent points Oj , . . . , a„ _ j in A with y a, = 0 ( 1 ^ ^ w — 1 ) 
is that y = ta* for some real t and some a* in A*. 

Suppose first that ya^ = Q — 1). Then by Theorem I 

Corollary 2 there is a basis b, for A such that 

a.- = fii bi + • ■ ■ + bj {vn =b 0) 

for integers Vyy. Hence yb~0 — 1). Let yb„=t. Then 

clearly y = tb* where b* {i^j^n) is the polar basis to by. This proves 
half the lemma. 

Suppose now that y = ia*, where a* € A*. If a* = 0 there is nothing 
to prove. Otherwise, a* = mb*, where w is an integer and b* is primitive*. 

' That is, not of the shape mc*. c* 6A* for an integer « > 1. 




The polar lattice 



25 



Then b* can be extend to a basis b* for A*. Let bj be the polar basis. 
Then 

y bj = mtb* bj = 0 (2 ^ ^ . 

This concludes the proof of the lemma. 

Let A (a*) be the set of a in A such that a*a—0. Clearly if Oj and 
U 2 are in A(a*), so is for any integers u^, u^- By Lemma 6 

if a* ( A* there are n — i linearly independent points of A(a*), and so 
in a sense A(a*) is an [n — t) -dimensional lattice. The following corollary 
makes those remarks more precise. 

Corollary. Let b* = (b*, be a primitive point of A* and 

suppose that b*=^0. Then the set of [n ~ \)-dimensional vectors a' = 
(«! , . . . , a„„i) such that for some the vector a = (a^, is in A and 

satisfies b*a=0 is an {n — \)-dimensional lattice M of determinant 
d{\A)=\bt\d{A). 

We note that M is the projection on x„ = 0 of the set A(5*) just 
defined. Since 6*=j=0, if a„ exists it is uniquely determined by .... 
and the condition b*a=0. 

We may suppose that b* = b*, where bf, ..., b* is a basis for A* 
and bj is the polar basis. After what was said before the enunciation 
of the corollary, it is clear that the {n — 1) -dimensional vectors b' formed 
by taking the first n — \ coordinates of bj are a basis for M. We now 
show that 

bt det (bi , . . . , b„) = det (b( . (1 ) 

If in the determinant in the left the «-th coordinate x„ is replaced by 
b*x for a5 = bi, ..., b„, the value of the determinant is multiplied by 
btn=b*. Since b*bj=0 for i<j^n — 1 and b*b„=t, the equation (t) 
follows at once. In particular |6*| d{A) =d{M), as required. 

1.5.3. Finally we must note the effect of homogeneous linear trans- 
formations on the relationship between polar pairs of lattices. Let 

X = TX ( 1 ) 

be a non-singular homogeneous linear transformation given by 

i 

where 

det(T) = det(T,y) =k 0. 

If F is any vector, we have 

YX = ZYjX, = ZY,XjjXj. 
i i, i 



( 2 ) 
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Hence 

YX = yx, ( 3 ) 

where 

= (\^ j £ n ). (4) 

J 

Since det (t) 0, by hypothesis, the equations (4) define 1" as a function 

of y. We write 

Y = x*y, 

where t* is called the transformation polar to t. 

Lemma 7. Let x be a non-singular homogeneous linear transformation, 
A a lattice, and tA the lattice of points tx, xe/\- Then the polar lattice 
of x/\ is t*A*, 10 here t* and A* are respectively polar to x and A. 

This follows at once from Lemma 5 and equation (3) above, where 

X =TX, Y =T*y. 



Chapter II 

Reduction 

11.1. Introduction. In investigating the values taken by an algebraic 
form f(x) for integer values of the variables it is often useful to sub- 
stitute for / a form equivalent to it (in the sense of Chapter I, § 4) which 
bears a special relation to the problem under consideration. This process 
is independent of the geometrical notions introduced by Minkowski 
and depends only on the properties of bases of lattices developed in 
Chapter I. Indeed only the lattice Aq of integer vectors comes into 
consideration. 

It is convenient to collect together in one chapter the various 
applications of reduction. The later parts of the chapter involve some 
moderately heavy computation. The beginner might well omit all after 
the enunciation of the results in § 4.2. Indeed the next few chapters 
are practically independent of Chapter II, which might well have been 
deferred until later. 

In § 2 we discuss the general method. In the rest of the chapter 
we shall be mainly occupied in investigating 

M(/) = inf|/(u)| 

u=t=o 
H integral 

where f{x) is a form of a special type. Definite and indefinite quadratic 
forms are treated in §§3-4 respectively and binary cubic forms in § 5- 
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The methods of this chapter have been successfully applied to 
related problems ; for example, when / (x) is indefinite, to the estimation of 

inf /(u) 

over integer vectors M=j=o for which /(u) is positive [either in the strict 
sense /(m)> 0 or the weak sense /(u)^0: two distinct problems in 
general] but we shall do this only for binary forms. 

A table listing the known results about quadratic forms is given in 
an appendix. We shall be considering quadratic forms later from other 
points of view. 

Davenport and Rogers (1950a) have shown that in many cases 
not merely one but infinitely many integer points u exist such that 
/(u) satisfies the inequalities stated. This requires deeper methods than 
those used here and will be discussed in Chapter X. 

It should be remarked that there is a classical theory of reduction 
for indefinite binary quadratic forms which we do not discuss here. 
Although it comes into the general scope of reduction as defined here, 
that is the choosing of bases with special properties, it is best under- 
stood after the discussion of Chapter III. It is closely related to con- 
tinued fraction theory. See Chapter X, § 8. 

II. 2. The basic process. We first discuss the standard procedure 
for positive definite forms f{x)] that is for forms such that f{x)>0 
for all real vectors x=|=o. 

We note first that if f{x) is positive definite of degree r, say, then 
there is a constant y.>0 such that 

f{x)^y.\xY (1) 

for all real x, where we have written 

[a?! = [x\ x'Y-. 

For on the surface of the sphere |a;| =1 the continuous function f{x) 
must attain its lower bound y, so y>0', and then (1) follows by homo- 
geneity. In particular, there are only a finite number of integral 
vectors* u such that /(u) is less than any given number. 

We now choose a basis for the lattice Aq of integral vectors with 
respect to the positive definite form f{x) as follows. Let e]=h0 be one 
of those integral vectors u for which /(u) is as small as possible. By the 
argument of the last paragraph such u exist, and there are only a finite 
number of them. If e[ were of the shape e[—ka, o£Aq, where ^>1 
is an integer we should have 

0</(a) =k-'fle[) <f{e[), 

* i.e. vectors whose co-ordinates are rational integers. 
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contrary to the definition of . Hence by Corollary 3 to Theorem I of 
Chapter I, we may extend to a basis e(, bj, •••, b„ of the lattice Aj 
of integer vectors. We now choose e' in succession. Suppose 

that ej , . . . , e'_i have already been chosen and are extensible to a base 
ej, ..., ey_i, 6,, .... 6„ of A#. Then e' is one of the finite number of 
vectors with the property that e[, e' is extensible to a base of Aq 
and for which /(e') is as small as possible. Such e' exist but are finite 
in number, by argument used for e[. In this way we obtain a base 
e[, and for any given f{x) there are only a finite number of such 

bases. 

If the function f(x) is such that we may indeed choose 

e' = e.- = (0, . . . , 0, 1 , 0. . . . . 0) (f ^ ^ n) 

for the above basis, then f{x) is said to be reduced (in the sense of 
Minkowski). The above proof shows that every positive definite form 
is equivalent (in the sense introduced in Chapter I, § 4 ) to at least one 
and to at most a finite number of reduced forms. 

We may make the definition of a reduced form more explicit. By 
Corollary 4 to Theorem I of Chapter I (or by Lemma 2 of Chapter I), 
a necessary and sufficient condition that Cj, ...,e^_j and the integral 
vector u = (mj , . . . , «„) be extensible to a basis for Aj is that 

g.c.d.(«, ■,..., M„) =1. (2) 

Hence the forip f{x) is reduced if and only if 

/(«!,...,«„) ^/(e,.) 

for all j and for all integers u^, satisfying (2). 

II. 2 . 2 . When the form f{x) is not definite, then there is no generally 
valid procedure to replace the reduction procedure for definite forms. 

If we know (or may assume) that /(u) does not assume arbitrarily 
small values for integral u=f=o then it is possible to salvage something 
of the reduction procedure. Let £>0 be chosen arbitrarily small. By 
hypothesis. 

Ml = inf |/(m)| > 0. 

U^O 

integral 

Hence we may find an integral e^=)=0 such that 

|/(e;)|^Mi/(l-c). 

Without loss of generality e[ is not of the form ka where a is integral 
and ^>1 is a rational integer. If ej , . . . , e'_i have already been found. 
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write 

M, = inf|/(u)l 

where the infimum is over all integral vectors u such that e[, e'_, , u 

is extensible to a basis for Aq. Then 

and so we may choose e' so that e[, . . . , e' is extensible to a basis and 

Let f'(x) be the equivalent form for which 

/(«:)=/'(«;)■ 

Then we have 

|/'K, ^ (1 - £)|/'(e,)| 

for all sets of integers Mj , such that g.c. d. {u^, 1 . But, 

of course, there is no reason to suppose that there are only a finite 
number of /' with this property and equivalent to a given /. 

An alternative procedure which is sometimes possible is to find some 
other form g (x) , related to our given /, which is definite and to reduce 
g{x). We shall do this for binary cubic forms in § 6. This method goes 
back to Hermite, who applied it to indefinite quadratic forms as follows. 

Let f{x) be an indefinite quadratic form of signature [r,n — r], so 
that, as before, 

/(X) =X? + ... Xl, (1) 

where the Xj are linear forms in Xj , . . . , . Then 

g(x)=X\ + ---+X^,+XU,+---+Xl ( 2 ) 

is a definite quadratic form with the same determinant, except, perhaps, 
for sign. The forms Xj, ..., are not uniquely determined by /(x) 
but we say that /(x) is reduced (in the sense of Hermite) if the form 
g (x) is reduced in the sense of Minkowski for some choice of Xj , . . . , X„ . 
Clearly /(x) is always equivalent to a reduced form, since we may choose 
any representation (1) and then apply the transformation which reduces 
g(x). Reduction more or less of this kind was first introduced by 
Hermite, and has been further discussed, amongst others, by Siegel 
(1940a), as a tool for investigating the arithmetical properties of quad- 
ratic forms. In general a form /(x) is equivalent to infinitely many 
HERMiTE-reduced forms, but Siegel shows that it is equivalent to only 
finitely many if the coefficients of /(x) are all rational. 

We note here that the relationship between (1) and (2) allows 
estimates for the minimum of a definite form to be extended to an 
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indefinite one, since clearly |/(a;)| ^g(as) for all real vectors x. But in 
general, the information so obtained is quite weak. 

II. 3. Definite quadratic forms. We shall be considering definite 
quadratic forms from many different points of view in the course of 
this book. Here we see what can be done by reduction methods alone. 
The study of reduction is of great importance in the arithmetical theory 
of quadratic forms, see Weyl (1940a) or van der Waerden (1956a), 
who give references to earlier literature. Here we are concerned only 
with the minima of forms. 

Let 

/ (•*! I ^ 2 ) ~ f 11^1 T 2/j 2 .^1 ^2 A 2 ^2 

be a positive definite quadratic form. We wish to prove that there are 
integers (Mj , M 2 ) 4= (0, 0) such that 

/(Ml, M 2 ) ^ (4Z)/3)% 

where 

D = /ll/2 2~/l2- 



By taking an equivalent form, if need be, we may suppose that 
M{f) = inf f{u^,u^) =/„. 

U), 1<2 

integers not both 0 

We have 

\ /ll / /ll 



Put M 2 = 1 and choose for Mj an integer such that 





M] 4- 


A 2 ^ ' 
A. = 2 ■ 


(1) 


Then, on the one hand. 


/(«! 


.f)^/ii> 




and, on the other. 


/(«i.I) 


< ^ / 4- ^ 

a7- 


(2) 


Hence 


D 

7iT 


^ ~/n. 




that is 




Q 

VII 





as required. That ^ here cannot be replaced by < is shown by the 
form 

/o(y,, Y2) = A'l 

for which = f and / (Mj , Mj) ^ 1 for integers (Mj , u^) 4= (0, 0) . It is not 
difficult to show by examining when equality can occur in the above 
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argument that ^ can be replaced by < unless / is equivalent to a 
multiple of /q. We do not go into details, since we shall prove this 
later more simply. 

II. 3. 2. As Hermite noted, this argument can be extended to prove 
the following theorem. 

Theorem I. A non-singular quadratic form 

f[x) 

represents a value f(u) with 

|/(u)l^(f)("-i>/^|Z)|>/", (1) 

where u o is integral and 

Z3=det (/,,). 

By the remarks at the end of § 2.2 we may suppose, without loss 
of generality, that f{x) is positive definite. We may now suppose, as 
before, that 

for all integral u =|= o. Then 

j(x) + .^^2 + 

' /ii hi / 



where glx^, ..., v„) is a definite quadratic form of determinant Dlf^. 
Since we may suppose the result already proved for forms in m — 1 
variables, there are integers u^, ...,m„ not all 0 such that 



g(W2. •••. « 



= \/n/ 



Choose the integer rt, so that 



Then 



and so 



«i + 



hi 



: + 



+ ArfK 

I 1 

hi 



< 



1 

2 



iti- 



4 D 

Jl I7u/ 



/ll^ 



4 

y 









This proves the assertion. Unfortunately, the constant (1)'^*"“'* is the 
best possible only for n = 2. We shall show below that it is not the 
best possible for w=3, and since the above proof is by induction it 
cannot be best possible for w^3- If is possible to modify the argument 
to give the best possible result for w = 3 [for a neat version of this see 
Mordell (1948a)], but we shall not do this. Instead we give a more 
elegant, if more artificial, treatment depending on a more detailed 




32 



Reduction 



examination of reduced forms which goes back essentially at least 
as far as Gauss. 

II. 3.3. We start with the consideration of a positive definite binary 
form which is reduced in the sense of Minkowski: 

/(xi.Xjj) = hixl 

After the substitution x^-^x^, x^-^ — x^ if need be, we may suppose 
without loss of generality that 

/l2= 0- (1) 

By the definition of reduction, 

/22=/(0,1)^/(1,0) (2) 

and 
that is 

2/i2=/ii- (3) 

By (1), (2), (3) we have 

40 — 3/11/22 “ /11/22 4/12^ /ii 4 /? 2 = Ol 

and so 

/hS/il/22^fO. 

The sign of equality is required only when fii=f 22 — '^fi 2 '> i-®- when 

/(*) =fll{xl+Xj^X2 + xl). 

Before going on to ternary forms, we note that any form satisfying 
(1), (2), (3) is reduced. This is a special case of the general theorem 
that MiNKOWSKi-reduced forms can be characterised by a finite set of 
inequalities, but here it is easy to verify directly. 

Let Mj, be integers neither of which is 0. If |mi|S|« 21 we have 

/(mj, Mj) = I ||/i j I MjI rb 2/12 1 M2I} + /22M2 

= I ^l| {/ll I **ll ~ 2 /i 2 I Mil} + /22*^2 

~ ^i(/ii 2/12 ) T" 
^/i1-2/i2+/22=/(-G1): 

and if 0 <|mi|^|m 2| the same inequality follows on reversing the roles 
of Ml and M2. Since /n— 2/12 4-/2 2^/2 2. by (3), we have shown that 
f{x) is reduced. 

In particular, if t is any number then the form 
/( = *?+ Xi X 2 +(( + i) xl 



is reduced. Since 
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and the determinant D of /< is t, we see that 



M{f)IDi 

may take any value This is in striking contrast with the 

behaviour of indefinite quadratic forms (see § 4 ). 

For later convenience we collect what has been proved so far and 
express it as a theorem. 

Theorem II. A positive definite binary quadratic form 
/i 1 ^1 “h 2/j 2 Xj Xj -(■ /a 2 ^2 

is reduced if and only if 

1 2/ial ^ /ii ^ fa- 

The three smallest values taken by f{u) for a reduced form and integral 
u 4 =o are /u,/22 /11-2I/12I +fii. where 

/n ^ /a2 = /ii ^ 2I/12I +/22- 

For a reduced form 

/11/22 ^ 4 T */3 1 

where 

^ = /n/22 /l2- 

The ratio Q=fnlD^ 'may take any value in the interval 






1 1 . 3. 4. We now consider ternary quadratic forms. As we shall 
later be considering definite quadratic forms in a wider context 
(Chapter V, § 9 , see also Chapter IX, § 3 . 3 ) we content ourselves with 
the following. 

Theorem III. A. Let 

/(*) (/i; = /;.) 

be a positive definite ternary quadratic form. Then there is an integral vector 
u^o such that 

f{u) ^ (2D)K 

where 

Z)=Z)(/)=det(/,,). 

B. If f[x) is reduced, then 

/n/22/33^ 2 i). 



Casseis, Geometry of Numbers 
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C. The signs of equality are required, when and only when f{x) is 
equivalent to a multiple of 

fo(x) =xl+xl+xl+XiX3 + XgX^ + x^x^. 

We note again that we get as good an estimate for /u/22/33 as we 
do for /ii. This will be put in a wider setting in Chapter VIII, § 2. 

Since /,(«) is an integer we have fg{u)^\. Since D[f^=\, this 
shows that the equality signs are required for /q . Part A of the theorem 
follows from the rest. Hence we need only prove Part B and that 
equality in B occurs only for multiples of /j. 

Following Gauss (1831a) we distinguish two cases. Suppose first 
that 

fiif%zfzx= 0 . 

Then after a substitution 

Xi^±Xi 



we may suppose without loss of generahty that 

/l2^0, /23^0, /si^O. 



Write 

Then 






for all i q=; since / is reduced. For example 

/(I, -1,0)^ /(1, 0,0) 

gives «?2i = 0- We have identically 

2D A 1 ^2/33 “ '^32^21^13 + 2 {A 1/23^23 + /23^13^2l}> 



( 1 ) 



( 2 ) 



where the sum is over cyclic permutations of 1,2,3; as is readily 
verified on expressing both sides in terms of the A> alone^. Since all 
the terms on the right-hand side of (2) are non-negative, we have 

/ll = Ai/22/33 ^ 2Z), (3) 

as required. 

The other case is when fuf^zki'^^’ then we may suppose that 



A2^0> /23=0» Ai^O. 

We write now 

fa = fa + Vii 

and 

ft), = /(l,l,l)-Ai- 

^ This is an application of Littlewood’s Principle: all identities are trivial 
(once they have been written down by someone else). 
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Then since / is reduced. Then identically 

60 — 3/11/22/33 ~ W23V31W12 + 

+ 2%2V'l3V’21+Ii{/ll(-/23) (v'23 + 2ct)i) +(-/23 )V'i3 V ’2 l} • 

Again all the terms on the right-hand side are non-negative, so (3) holds. 

We leave to the reader an examination of when equality can occur. 
A rather tedious investigation of cases shows that it can occur only 
when 

fll— f22 — / 3 a 

and either 2/23=2/31=2/12 = it, or one of 2/23, 2/31, 2/12 vanishes and 
the remaining two are equal to i 1 . But all these forms are equivalent 
to fiifo(x), as is readily verified. For example, 

*1 i^2 +Xl i%^2i«2^3=/o(%>^2+^3. ~ • 

Gauss lists several other identities which could be used instead of 
those here. 

II. 4 . Indefinite quadratic forms. These will also be considered 
again and again throughout the book from different points of view. 
A table listing known results is given in Appendix A. We do not here 
carry the reduction argument as far as it will go, but only far enough 
to illustrate the different nature of the results from those obtained in 
the definite case. 

We shall continue to use the notation 

M{f) =inf |/(u)|, 

U #:0 

integral 

where f{x) is a form in any number of variables, and write 

Z)=i)(/)=det (/,,.) 

for a quadratic form 2 = /(*)• 

There are two characteristic differences between the behaviour of 
M(f) for definite and indefinite forms. For definite binary forms we 
saw that M(/)/|Z)(/)|^ could take any value q in an interval 

where (f)^ was the maximum possible value. It is not difficult to verify 
that definite quadratic forms in any number of variables behave simi- 
larly, cf. Chapter V, Lemma 6. The first difference in the behaviour 
of indefinite quadratic forms is rather trivial: it is quite possible that 
M{f) =0, and this may occur either because there is an integral M=)=o 
such that /(m) =0, or because there are integral u=f=o such that f{u) 

3 * 
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is arbitrarily small but not 0. The second difference is deeper: the values 
of Af(/)/|D(/)|l do not fill the complete interval up to the maximum 
possible value. 

The position for indefinite binary quadratic forms has been the 
most investigated. Here a very great deal is known about the possible 
values of M(/)/|D(/)|l. The greatest value is {fj\ given by the multiples 
of xl-\-XiX 2 — xl. Otherwise M(f)^{^)^D{f)\K A well-known theorem 
of Markoff (“the Markoff chain”) states that there are only de- 
numerably many possible values of M(/)/|D(/)|* greater than f . There 
are certainly intervals to the left of f which contain no values of 
M(f)l\ D (/)|^. The author has given a proof of the Markoff chain theorem 
in his Cambridge Tract [Cassels (1956a)], to which the reader is referred 
for references for the various statements made in this paragraph. Here 
we shall be content with finding the two largest possible values of 
M(f)l\D(f)\K 

There is a similar state of affairs for ternary quadratics but much 
less is known. The most complete information is due to Venkov (1945 a) 
who has found the eleven largest values of M(/)/| Z)(/)|l, but they do 
not seem to follow any general pattern, except that they are all given 
by forms with integral coefficients. There are two unsolved problems 
about indefinite ternaries which appear completely intractable. It is 
not known whether there are forms / with M{f)>0 which are not 
multiples of integral forms; and it is not known whether the set of 
values of M{f)l\D{f)\^ has any limit point other than 0. These two 
problems are closely related [Cassels and Swinnerton-Dyer (1955a); 
see also Chapter X, Theorem XII]. 

This phenomenon of "successive minima" (not to be confused with 
the "successive minima” of a lattice with respect to a point set which 
is discussed in Chapter VIII) occurs very widely with indefinite forms. 
It takes a great many different shapes and a general theory hardly 
exists*. It is not possible to predict when it occurs: for example it does 
not occur in the problems discussed in § 4.5 or § 5. 

It is not difficult to see how "successive minima” can occur. An 
inequality of the type |/(u)|^l, where f{x) is an indefinite form and u 
is an integer vector, is really a pair of alternatives 

either / (u) ^ 1 

or /(u) ^ — 1 . 

Each of these inequalities may be regarded as a hnear inequality in 
the coefficients of /. If we consider a large number of different u then 

* Mahler has shown that the minima form a closed set. In fact this follows 
at once from his compactness theorem of Chapter V. 
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the various pairs of alternatives are a priori independent. It may turn 
out, on combining the various alternatives, that some combinations of 
alternatives are altogether impossible while other combinations of alter- 
natives define a form / uniquely. An example may make this clearer. 
Suppose that we are interested in binary quadratic forms for which 



M(/)=1 and /(1,0)=1. 
Such a form has the shape 

( 1 ) 

where the coefficients a 
and are to be investi- 
gated. The only such form 
which satisfies the ine- 
qualities 

/( 0 , 1 ) ^- 1 , 

/(I, I) ^+1, 



/W = 

xl+a.x^x^ + ^x\. 




/(2, -1)^ -M, 



Fig. 4 



is x\ x^x^ — x\, as the reader will easily verify. Hence any other form 
with / (I, 0) =1 and M{f) =\ must satisfy at least one of the inequalities 
/(O, 1)^ -ft, /(t, 1)^ — 1, /(2, —1)^ — 1 . The form x\-\- x^x^ — x\ is 
thus in a strong sense isolated from all other forms (1) with M(f)=\. 
For example if a and ^ are plotted as cartesian coordinates for the 
form /, a condition \ \ excludes a strip of the plane between 

two parallel lines. The three conditions 



1 /( 0 , 1 ) 1 ^ 1 , 1 /( 1 , 1 ) 1 ^ 1 , 1 /( 2 , - 1 ) 1^1 

exclude three strips. What is left consists of the point (1, —1) and a 
number of infinite regions which are separated from the point by one 
of the strips (see Fig. 4). 

In the actual proofs, this general principle tends to be obscured. If / 
is an indefinite form and M{f) = 1 there is not necessarily an integral 
vector u with |/(u)]=l, though there are integral vectors with 
1 ^ I / (w) 1 < 1 -f £ for any given £> 0, and further devices must be used 
to deal with this. The difficulty is that if / > 1 , then the form tj{x)= f'[x) 
satisfies the same choice of inequalities “/(u)^1 or /(m)^ — 1” as the 
original f{x). Here t might be arbitrarily close to 1, that is, the coef- 
ficients of f'(x) might be arbitrarily close those of f(x). Hence to pin 
down f(x) uniquely we must some-how make use of the normalization 
M{f)=\. We do this by first finding the determinant of the form in 
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question and then using this as part of our information. The actual 
proofs will make the details clearer. 

We shall later deal with isolation of this type from a more sophisti- 
cated point of view (Chapter X) . The treatment there will also help to 
show why the additional devices just mentioned are effective. 

1 1. 4. 2. The problem of the minimum of indefinite binary quadratics 
has already been discussed in §4.1. All we shall actually prove here 
is the following. 

Theorem TV. Let 

/(*) =/ll^f + 2/12^1^2 +/ 22 ^i ( 1 ) 

he an indefinite form and 

D=D(f) =Ai/22-/L- 

Then 

M(/)=inf|/(«,,«2)|^j^f, (2) 

except when f is equivalent to a multiple of one of the two forms 

fo{x) =xl+x^x^-~ x\, (3) 

f,{x)=x\- 2x\ (4) 

for which M{f) = 1 and | D | = | , 2 respectively. 

That /o and are exceptional is clear, since they both represent 
only non-zero integers for integral M=f=o. The constant in (2) 
cannot, in fact, be improved since the next form of the Markoff chain is 

/2 = 5^? +11 ^1^2— 5x2 

which has D{j^ = — 221/4 and can be shown to have M{f^ =5. 

We now prove Theorem IV. If M[f) = 0 there is nothing to prove. 
Otherwise, we may suppose, without loss of generality, that 

M{f) = 1 , 

by considering tf instead of /, where t is a suitable number. By the 
general argument of § 2.2, there is a form g{x) = g,(®) equivalent to 
f{x) for which 

l^lg(1.0)|<(1-£)-i, 

where e is any given positive number in the range 0<e<1. Put 
±g(1,0) = (1 -??) + 

where 

0'^r} = ric<e< \ . 
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Since the equivalent forms / and g have the same determinant D, we 
may write g{x) in the shape 

±g(a,)=J^i±^-|Z)|(t-,?)4, (5) 

where a = a, is a real number, which may be supposed to satisfy 



0 ^ a ^ -J- 



( 6 ) 



on replacing by ±^i + v:t 2 "'•^h a suitable integer v. Since M(f) =1, 

we have either , , ,, 

K+^a»2) (7) 



or 



■Vs 

(«, + a,«2)^ 

1 - Vs 






( 8 ) 



for each pair of integers Mj , not both 0. Of course as e changes there 
is no reason to suppose that for fixed u the same alternative (7) or (8) 
always holds. 

We consider various suitable pairs of integers and must con- 

sider various cases according as (7) or (8) holds for the integers in 
question. Since we wish to single out the forms (3) and (4). we natu- 
rally choose values of u such that /q(u) = ±1 or /i(u) = ±1. 

In the first place, (7) cannot hold with (mj, u^) =(0, 1) since by (6) 
it would imply |D|<0, at least when rj is small enough. Hence on 
putting (Mi, Mj) = (0. t) in (8), we have 



— rj)^\D\^ {i ~ rj) + (9) 

for all e less than some eo>0. 

We now consider the two possibihties when (Mj, Mj) = (f . t)- Suppose, 
first, that there are arbitrarily small values of s such that (7) holds. 
For these e we have, suppressing the suffix e, that 



(1 I Z)| g — (t —??) -F (1 -fa)^ (10) 

On eliminating |Z)| between (9) and (10) we have 



2x^i-2r] 

and so 

( 11 ) 

by (6). On substituting this in (9) and (10), it follows that |D| can 
differ from J at most by terms of the order of rj. But now lii| is in- 
dependent of Tj and either rj=0 or rj>0 can be made arbitrarily 
small. Hence )£)| . We now revert to one particular g{x) =gg(x) 

for which (10) is true, where now we have the additional information 
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I D| = f . On substituting D =f , oL^^ — rj in (9), we have 

0 . 

Since r]<2, this implies t/ = 0. Hence a=J and 

±g{H. Xi) = (^1 + - f *2 = /o (^1 . X^) ■ 

Otherwise (10) cannot hold, when e is small enough; and so for all e 
less than some £i>0 we have (8) with u = (l, 1), that is 

(1 —??)*! D| ^ (1 -t?) + (1 +a)^ (12) 

We now consider the possibiUties for u = (— 3, 2). Note that 
/i(— 3, 2)=1, where is given by (4). If there are arbitrarily small 
Vcdues of £ such that (7) holds with u = (— 3, 2), then for these s 

4(1 - ??)^ 1 Z)| ^-(1 -»?)+(- 3 +2a)2. (13) 

On eliminating |D| between (12) and (13) we have 4a^»;, so 0^4a^j; 
by (6). On substituting in (12) and (13) and using the fact that r}=0 
or Yj can be made arbitrarily small and positive, we find that |0| =2. 
Finally on putting |Z)| =2, a^O in (12) we get ?;=0, so a=0 and 

±g(») =x\-2x\=ti{x). 

Otherwise for all e less than some £2>0 we must have (8) with u — 
(—3, 2), that is 

4(l-»?)*|D|^(l-»?) +(-3 +2a)^ (14) 

But now the right-hand sides of (12) and (14) increase and decrease 
respectively in O^ag-J-. If we use (14) and if we use (12). 
In either case we obtain |Z)|^2.21 0{rj), so |Z)|^2.21 since |Z)| is 

independent of jy. 

It is at first sight remarkable in these proofs that the inequahties 
obtained show that r]=0. As already mentioned, this is tied up with 
the phenomenon of “isolation” which we shall discuss more fully later. 

II. 4.3. We consider now the “one-sided” problem for indefinite 
binary quadratic forms. In contrast with § 4-2 there is here no set of 
successive minima. Theorem V A, which we now enunciate, is a special 
case of Theorem IX of Chapter XI and is due to Mahler. 

Theorem V. A. Let 

/(®) =/ii^i + 2 / 12 % ^2 4-/224 
be an indefinite quadratic form and 

^ — /n/22 fli- 
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Then there is an integral vector u4=o such that 



0</(u)^2|£»|i. 



( 1 ) 



The sign of equality is required when and only when f is equivalent to a 
multiple of f(x\-xx 



B. For any £>0 there are infinitely many forms, not equivalent to 
multiples of each other, such that 

M^{f)= ini f(u)>(2-s)\D\K (2) 

/ (m) > o 
u integral 

We first prove A. That /q = XjAJj is exceptional is obvious, so we 
need only prove (1) and that equality can occur only when stated. As 
in § 4.2, we may suppose that 

where M^{f) is defined by (2). Hence, as in § 4.2, there is a form 
equivalent to /, where 

0 ^ ^ 



and rj'^0 can be made arbitrarily small *. Suppose, first, that 
?(— 1> 1)^ 1. Then 

(1 —rj)^\D \ ^ (1 — a)® — (1 —r])^rj. 



which is impossible if rj is small enough, since is independent of tj. 
Hence g(— 1, 1)^0, that is 



the sign of equahty being required only when a.=j,rj=0', that is when 
g(x) = (%i + iXj)* - Ixl = + Xj) = /o(Xi, Xj + Xj) . 



It remains to prove B. It will be shown in § 4.4 that the forms 
/t(x) =^(x?+XjX2) -xl 

have 

M+(f,)=k 



when ^ is a positive integer. Since 
|Z)(/*)| + 

• More precisely, we should work with a family of forms gg(x) as in § II 4.2. 
Having once carried out this type of proof in full rigour, in the rest of this chapter 
we shall be more informal. 
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the ratio 



M+mD[f,)Y 



may be arbitrarily close to 2. 

Another simple proof of B would be by means of continued fractions. 



II. 4.4. As an interpolation between the problems of § 4.2 and 4.3 
one may consider the forms f{x) such that there is no integral point 
M=bo in 

— a < /(u) < b, 



where a and h are given positive numbers. 

For some values of a and b one may deduce the least possible value 
of D{f) from the results of §4.2. For example^ if 



a = 1 , 



11 



we certainly have 



and so by Theorem IV either 



or / is equivalent to 



M(/) ^ 1 , 
\D{i)\^2 



t {x\ + a ;2 — x\) 



for some t. In the second case it is clearly enough that The 

corresponding determinant is • -^ < 2. Hence we have an isolated 

first minimum. Note that the form with the least |D| does not take 
any values in the neighbourhood of —a. 

For any given values of a and b the techniques of §§ 4.2, 4.3 some- 
times apply. For example, the minimum determinant when a = 5, =3 
is |Z)| =24 given by ^xl — Sxl] this being isolated. The verification of 
this statement is left to the reader. Here we shall prove only the follow- 
ing theorem due essentially to Segre (1945a). 

Theorem VI. Let 

/(®) =/ii^? + 2/12^1^2 + /a 2 ^2 (I) 

have determinant 

( 2 ) 



Suppose that there is no integral u =)= o such that 

-a<f(u)<b, (3) 

where a>0, b>0. T hen 

\D\'^ ab + j max (a^, b^) . (4) 



' This remark was made to the author by Professor C. A. Rogers. 
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If b>a, the sign of equality is required when and only when 

k=bfa (5) 

is an integer and 

/(*)=«/*(*). ( 6 ) 

where 

fk{x) =k[x\+ x^x^) - x\. (7) 

For ^ = 1, Theorem VI is contained in Theorem IV. When k is not 
an integer, an explicit improvement of (4) can be given. When k is an 
integer, there is isolation and much more is in fact known [Sawyer 
Tornheim (1955a)]. When b^a the cases of equality may, 
of course, be deduced from the theorem by interchanging a and b. 

We may suppose without loss of generahty that 
a —i , b = k, 



where at first k is not necessarily an integer. Let 



so that 



c=M+{f) = inf /(m), 

/( M )>0 

c^k. 



As in § 4-2 there is a form g{x) equivalent to f{x) of the shape 
where 

0 ^ a ^ j 



and rj^Q may be chosen arbitrarily small. 

Clearly g(0, 1)<c, so g(0, 1)^ — 1. Hence g(1, — l)<c, and so 
g(f . —1)^ — 1, that is 



Hence 



' ' f — (1 — '■ ^ 

\D\^c+ic^^k+lk^ 



with equahty only when 

t]=0, a = ^, c=k, 



so 



g{x) =fi,{x). 



It remains to see whether ^(ac) has any integral solutions u 4=0 of 
-i<f^(u)<k. (8) 
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Since /^(l, 0) >0 but — oo as oo, there must be some 

integer such that 

If (8) were insoluble, we should have 

+ 1)^ -1: 

that is 

v{k — v)'^0, (v +2){k — v)^0. 

This is possible only when v—k, i.e. when k is an integer. 

It remains only to show that when k is an integer there is no integral 

u=bo such that —\<f^{u)<k. Since the roots of f^{d,i)=0 are 

irrational, it is impossible that ^ (u) = 0. Hence we must deduce a 

contradiction from , , , , 

0<f,{u)<k. (9) 

If there are several solutions of (9) we choose one for which the integer 
I «i I is as small as possible. Clearly 

«i 0. 

We require the identities 

/*(*) = /* {(A + f) ^1 - Xi, -kx^ + x^} 

= AK + ^2. kxi + (k -\-\)x^) 

= {k + 2) x^{(k + 1) - Xi) - {(A +\)xi- Xif - x\ 

= {k + 2) %(% + x^) — (%i + x^Y — x\. 

Since /*(u)>0, the last of these identities shows that 

+1)mi — «2}>0 
+ >0. 

On writing — u for « if necessary, we thus have 

«1>0, (A +1)Mi>« 2> — Ml- (10) 

From the first two identities and the minimal property of | | , we have 

I Ml + Mai ^ Ml. 

|(A +1)mi-«21 

and so, by (10), 

O^Mj^Ami, 0<Mj. 

But then 

f^{u) =ku\ + U^{kUi — Mj) S ku\ ^ k. 

Hence our assumption (9) was false. 
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By considering f[\,v) for all integers v, the estimate (4) may be 
improved when k is not an integer. Since 4 is the only form / satisfying 
/(1, 0) =k and 

/(O, f(i,k 

/(— 1,1)^— f, f(^,k)^k, 

the form /* gives an isolated first minimum when k is an integer. The 
proof of these statements is left to the reader (cf. papers quoted at the 
beginning of § 4-4). 

II. 4. 5. We now consider indefinite ternary forms. As already noted 
(§4.1) there is a set of successive minima, the first eleven having been 
found by Venkov (1945a). There is a derivation of the first four 
minima due to Oppenheim in Dickson (1930a) and a neat proof of the 
first minimum only by Davenport (1947a). Here we shall prove only 
the following result. 

Theorem VII. Let 

(1) 

be an indefinite ternary quadratic form with determinant 

D(/) =det (/,•,.) =#0. (2) 

Then 

M{f)^mi\f{u)\^\iD\i, (3) 

U #:0 

integral 

except when f is equivalent to a multiple of 

/o = ^i +%^2 — ^1 — ^2^3 + ^^ (4) 

Further, 

M{h)=\, D{f,)=^. (5) 

We first prove (5). Since /o(m) is an integer when u 4=0 is integral, 
it is enough to show that /p(u) =|=0. Now 

4/o(“) = (2«i + Ma)^ + ( 2«3 — u^^ — 6u\. 

Hence it is enough to show that there are no integral solutions of 

v\^v\ = 6v\ 

other than v^ = v^ = v^=0. We may suppose that v^,v^,v^ have no 
common factor. Then clearly v^ and must be divisible by 3- Then 
v\ + must be divisible by 9, so is divisible by 3 ; a contradiction. 

That the constant | in (3) cannot be further improved is shown by 
fi{x) =xl+x^X 2 — xl — 2x1. 
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The reader should have no difficulty in modifying the proof to show 
that this is the only case when there is equality in (3) and that it is 
isolated. 

We may suppose as before that 

M(/)=1 (6) 

and, by taking — / for / if necessary, that 

D< 0 . (7) 

We have to show that / is equivalent to /„ or Z)^ — It is convenient 

to enunciate steps of the proof as propositions. 

Proposition 1. Either 

f{u)=\ (8) 

/(«)><> 

or 

(9) 

If (6) is true but (8) is false, there must be integral u such that 
/(u) = — (1 — ?^)'^, where may be chosen arbitrarily small. Hence 

f{x) is equivalent to a form g(x) of the shape 

(1 -r])g{x) = - K + aT2 +/9%3)2 + A(t2,T3), 

where a, are real numbers and the form 

h[x) = A22 ^2 T 2^23 %2 ^3 ^33 ^3 

must be positive definite. The determinant of h{x) is 

^22^33-^13= - (1 -VVD =(1 -Tjy\D\. 

After a transformation on the variables x^, x^, we may suppose that 
h{x) is reduced; and so 

( 10 ) 

by Theorem II. 

We now consider the indefinite binary form 

G{x^,x^ =(1 —r])g{x^,Xi, 0 ) = - {xi+a.XiY + 
of determinant —h^^. Clearly 

M{G) ^ (1 — 7]) M{g) 

Hence, by Theorem IV, either 
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or G(xj, Xj) is equivalent to one of x^x^—xX) or t{x\ — 2x\) for 

some number t with j;). If the second alternative holds, we 

must have t = — \, since G(1,0) = — 1. Then there are integral 
such that G(u) = + 1, i.e. g[u^, 0) =(1 — so 

since rj^O may be chosen arbitrarily small. Otherwise the first alter- 
native, namely (11), holds, and so, by (10), 




This proves the proposition. 

We may now suppose that 

M+(/)=1. (12) 

As before, there is a form g equivalent to / such that 

(1 -rj)g{x) = {x^ + (x.X 2 +^X 3 Y + h{x.^,X 3 ), 
where may be chosen arbitrarily small, and the form 

h[X2, X^ ~ ^22 ^2 ”b 2^23 ^2 ^3 T ^33 ^3 (f 3) 

is now indefinite and has determinant 

hl3 = [\ -fj^D <0. ( 14 ) 

Proposition 2. If u^, are integers not both 0, then either 

^(«2.“3)^|->?. 05) 

or 

h{u2.U3)-£—2+rj, (16) 

or 

-f-4?^M«2.«3)^ -f +!?• (17) 

Further, if (17) holds there is an integer v such that 

\v +~l — (a.U2+Pu2)\^^rj. (18) 

We must first show that there are no integral solutions m 4=0 of 

— 2 +rj<h{u2,U3) < — z — rj, 

- z +rj <h(u 2 ,U 2 )^ -1+7?, 

-1 +Tj<h{U2,U3)<^-rj. 

We may clearly choose the integer u^ so that respectively 
1^ l«i + a«2 +?5 m 3| g |, 

|mi + a«2 +/3«3| ^ 1 , 
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0glMi + a«2+/3M3|^|. 

Then in each case we have 

(l-»;)lg(u)l<i -jy, 

contrary to hypothesis. 

Suppose that (17) holds. There is an integer t and a real number 
T such that by choice of sign 

a«2 +/S«3 = f ± T, 

We may clearly choose integers , «(' so that 

\u[ +a«2+/?M3| =1 - T 
I Ml + a M2 + ^ M3 1 = 1 + T . 

Then 

+ g(Ml,M2.«3)^0^g(Mi',M2,M3) + t] , 

and so 

h(Ui,U 3 ) + (1 - T)2 = g(M(,M2,M3) ^ -1 +r], (19) 

h{u^,u^ + (1 +T)2 = g(M(',M2,M3)^l -7]. (20) 

By subtracting (19) from (20) we have 

This is equivalent to (18) and so proves the proposition. 

Corollary. If (17) holds, then and M3 cannot have a common 
factor except d;l- 

For if Ui=vu 2 , U 3 =vu 3 , where v>i, none of (15), (I6) or (17) would 



be satisfied by h(u 2 , M3). 

Proposition 3. Either 

( 21 ) 

or, after an equivalence transformation, we may suppose that 

-l-n^h{\,0)^-l+ri, ( 22 ) 

h{\,-\)^l-r], (23) 

-l-n^h{\,\)<-l+ri, (24) 

h{2,-\)^-2+r], (25) 

( 26 ) 

(27) 

provided that rj is less than some absolute constant r}(,>0. 
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Suppose, first, that there are no solutions of 
-2 +rj<h(u^,U 3 )<l-rj. 

Then, by Segre’s Theorem VI, we must have 

\K,h 3 -hl 3 \^i{ 2 -rir + ( 2 ~ri)(i-ri)=i{ 2 -v){i-v)- 

Hence, by (14), 

|Z)|> >A. 

' ' 4 (1 — ~ 2 

Otherwise by Proposition 2 there is a solution of | A(«2, M3) 4-f | and 
by Proposition 2, Corollary we may suppose, after a suitable trans- 
formation on 2^2 > ^3 ■ that 

-|-^^A(1,0) =/f 23 g~|+»y. (28) 

After a further substitution of the type a; 2-> where v is an 

integer, we may suppose further that 

0^2A23^''*22 ^ -!->?• (29) 

We now consider /((mj, M3) for various choices of Mj, Mj. If h{ 0 , 1) 
^ — f+j?; that is ^>33^ — f +??, we should have 

^2 2 ^3 3 ^2 3 > 0 , 

contrary to the assumption that h is an indefinite form. Hence 
A33 > — z + f]’ ^nd so, by Proposition 2, 

/f33 = /f(0, 1 )S|- 7 ?. 

But now, by (29), 

h ~ ^ h33> — f-f-??, 

and so, by Proposition 2 again. 



^22 2^23 + A33 — ^(1, 1)^j~J?- (30) 

Hence 

^f(l, 1) = A(1, — 1) -f 4A23> — 2 

by (29). 

We now consider the two remaining possibilities for A (1 , 1) allowed by 
Proposition 2. Suppose, first, that 

h{\, 1) = ^22 -f 2^23 + ^33 ~V > 
so 

^33 = T V ^22 = 2 — 2 t] . 

Then, by (14), 

(1 -rj)^\D\ =/fi3- .^22^33^ 

Cassels, Geometry of Numbers 
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SO 



mi > A > A 

' ' ^ (1 2 — 2 ’ 



which is all we require. We may therefore suppose that 

— 4— =^22 + 2^23 + ^33^ ~f+^- 

We now invoke the part of Proposition 2 referring to a and ^ with 
(M 2 , M 3 ) =(1,0) and (1, 1). Hence there are integers v' and v" such that 






\v 

\v" + {a.+P)\^irj. 



After a substitution of x^-\-v' x^-\-{v" —v') for x^ we may suppose 
indeed that , , 






\rj. 



li — (IX +^)| 

Then , , 

We now consider 

A ( 2 , — 1 ) =h{\,\) + 3^22 — 6^23^ A ( 1 , 1 ) ^ — f +»?. 

We cannot have h{2, — 1)^— f — >?, since then by Proposition 2 the 
fractional part of 2a.— ^ would be about -J-, while we know that 2a.— ^ 
is 1 + 0(?^). Hence 

4 A 22 4^23 "b A 33 = h ( 2 , 1 )— 2 -b^‘ 

This completes the proof of the assertions of Proposition 3- 
We now conclude the proof of the theorem. The inequalities (22) 
to ( 25 ) of Proposition 3 are linear inequalities in A 22 , A 23 , A 33 . Put 



Then (22) to (25) become 



A22 — — f + . 

Ki = - - 2 - +M> 
A33 == f +vr]. 



Hence 



2v — (k 



U!^i, 

A — 2|«+i'^— 1 , 

|A + 2^+v|^1, 

4A — 4/< + r ^ 1 . 

2/^ + 1’) + (A + 2/i + v) 



( 31 ) 

( 32 ) 

(33) 

(34) 

(35) 

(36) 

(37) 



so 



2AS -4, 

3v = 4A — 4/2 "l~v 4~2(A -f-2/2 ■Tv) — 6A ^9, 

lH^3- 
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|y«1^3. 



Hence 

by (36). Hence and by (14b 

(1 ~rjf\D\ = hli—h^ih^s = l + (— + 0 {t]^). (38) 

= i + 0{ri) 

But D is independent of 7], so* 

Pl=l- 

Suppose, if possible, that rj^O. On putting \D \ =f in (38) we have 
-A-/X + |v = -i+0{rj). 

For small enough rj this contradicts (34), (35) and (36), since they give 
— A— = — fA+|-(A — 2 //+v) + f(A + 2,«+v) 



= 4 8 8 



Hence rj=0, so by (13), (26), (27), (3I), (32), (33). we have 

g(x) = (Xi + -l-X^)^ - ^xl -X^X3 + xl= /„(x) . 

Since g (a?l is equivalent to / (a;), this concludes the proof of Theorem VII. 

II. 5. Binary cubic forms. We must first consider briefly the 
algebra associated with a binary cubic form 



/ (^1 . ^2) = d x\ b x\ X3 ^ c Xy x\ -\- d x\. 



(1) 



Such a form may always be split up into linear factors with real or 
complex coefficients: 

/ (^ 1 . * 2 ) = 77 (■&i Xr + rpj X 3 ) . ( 2 ) 

With the form is associated the discriminant 



D{f)= n 0) 

lii<kS:3 

It is easily verified that 

D{f) = \8abcd + b^c^ — 4«c® — 4db^ — 27 a^d^ (4) 

(see § 5.2). From (3) it follows that D(f) =0 if and only if /(%j, Xj) 
has a repeated linear factor. Forms / with D{f) —0 are called singular. 

The discriminant D(f) is an invariant of the cubic, in the sense 
that if 

=f(xx3+^x3,yxi + dx3) (5) 

* More precisely, we should have worked with a family of forms ^^(x) as in 
§ II 4.2, each form with its own rj = rj^ and Q-^t] < e. Then X, fj,, v depend on e, 
but ( 38 ) is true for all sufficiently small e. 



4 ^ 
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identically for some numbers a, y, d, then 

D{f) = (oid-^yYD(i), (6) 

as follows at once from (3) and the fact that 

/'(% .Xi) = n Xi + y)'j x^) . ( 7 ) 

i 

where 

■&'j = at?,- + yfj, f'j = / 3 t?, + Sfj. (8) 

In particular, Z)(/') =D{f) if / and /' are equivalent, since then ( 5 ) holds 
for some integers a, /S, y, (5 with a.d—^y = ±\. 

If a, b, c, d are real, then either all the ratios are real or two 
of them are conjugate complex and the third is real, since roots | of 
an equation /(|, 1) =0 with real coefficients occur in complex conjugate 
pairs. This subdivides the real non-singular binary cubic forms into 
two essentially distinct types. We show now that two forms in the 
same type may be transformed into each other by a transformation of 
the type ( 7 ) with real a, y, d. It is enough to show that all forms / 
of a given type may be transformed into an /' which is fixed for the type. 
We may suppose without loss of generahty that either 

are all real (9i) 

or 

&3,y>3 are real, and — (92) 

in our two respective cases, where the bar denotes the complex conjugate. 
Clearly these two cases are characterised by Z) > 0 and £> < 0 respectively. 
There exist numbers Aj, A3 not all 0 such that 

hWi + hn + = 0 . 

Ai ??1 -|- A2l?2 + As^3 = 0 .) 

If, say, A3 = 0, we should have 

^1% — = 0< 

and so T>(/) =0 by (3), contrary to the hypothesis that / is non-singular. 
Hence AiA2A3=)=0 and we may suppose, without loss of generality, by 
multiplying Ai , A2 , A3 by a common factor, that 

Ai A2A3 = t . (11) 

We now distinguish the two cases according as ( 9 i) or (92) holds. If 
(9i) holds, we may suppose that Aj , Aj , A3 are real and put 

W,- = - A,- (dj -f y;,- X2) (/ = 1 , 2) . 
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Then 

and so, by (11), 



+ fz^i) = ^1 + ^ 2 . 



If (9a) holds, we may suppose that Aj=Ix, ^=^3 and put 



( 12 ) 



QXi + Q^X^ = Xi{&iXi+fiXi) ] 
Q^^i + qX2 = + y>2 X 2 ) , 1 


(13) 


where p is a complex cube root of 1. Then, by (10), 




•^1 T ■^2 ~ ^ (^3 T Ws ^ 2 ) 


(14) 


and 




f(x„X2)=Xl + Xl 


(15) 



The coefficients a, /3, y, 6 in 

X^ = d.Xi-\-^X2, X^=yXi-\-dx^ 

are real, since the two equations (I3) here are complex conjugates one 
of the other. 

In the sense of § 4 of Chapter I the values taken by non-singular 
binary cubic forms are the values taken by the function 

cp(X)=X^X^{X, + X,) 
or 



at the points of a lattice. The reader will have no difficulty in verifying 
that there is a corresponding result for singular cubic forms, with 

q>{X)=X\X„ 

<p{X)=Xl 



according as only two or all three of the linear forms &jXi + y>jX2 are 
multiples of each other. 

It was first shown by Mordell (1943 b) that if / is a real cubic form, 
then there is an integer vector u=po such that 



|/(«)|^ 



D i 

■ 

D i 

23 ’ 
e. 



( 16 ) 



according as D > 0, Z) < 0 or D = 0, where e is an arbitrarily small positive 
number. The third case, when f{x) is singular, may be dealt with 
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trivially by Minkowski’s theorem of the next chapter, so we do not 
discuss it here. That the coefficients 49, 2} are best possible in their 
respective cases is shown by the binary cubic forms 



x\ — 2 Xl x\ — x\ 


(17) 


x\ — Xj . 


(18) 



These have discriminants 49 and 23 respectively. Since they do not 
represent 0 and represent integer values for integer vectors u, the ^ in 
(16) cannot be replaced by <. It will be shown that < may be taken 
in (16) for all forms not equivalent to (17) and (18). 

The results (16) were not first obtained by reduction arguments. 
Davenport (1945 a, b) has however given simple proofs by such 
arguments. 

This treatment consists in defining a binary cubic form as being 
reduced if a certain definite quadratic form associated with it is reduced : 
it is necessary to choose different quadratic forms according as D>0 
or D<0. Davenport then shows for a reduced form that either (16) 
is true with strict inequality for one of a prescribed set of u, or f{x) 
is one of the forms (17), (18). We give the proof for D>0 in full but 
only sketch that for D<0 since we shall later be using the case D<0 
to illustrate another technique. 

It was shown by Davenport (1941b) that neither the 49 nor the 
23 is isolated. We do not give the proof, which depends essentially on 
the fact that although a cubic form f{x) is always indefinite the area 
of the region 

l/(x)l <1 

is finite, and the forms (17), (I 8 ) take the values ± 1 only a finite number 
of times; in contrast to the situation with indefinite quadratic forms. 

II. 5.2. In order to enunciate Davenport's result we must first 
introduce a quadratic form associated with a cubic form 



/(%, X 2 ) — a xl + b xl X 2 c Xj^xl d xl 



namely the hessian 






h(xi.x^) =^|(- 



O^ t 

dxydx^ ] 



gy ay I 

dx\ dx\ I 



= A x\ + B x^x^ + C x\, 



(1) 

( 2 ) 



where 



A =b^ — ^ac, 



B=bc — 9<id, C=c^ — }bd. 



(3) 

(4) 

(5) 
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On evaluating the partial differentials by (2), a brief calculation shows 
that 



h (Xi .X^) =z (^1 ( 6 ) 



the sum being taken over all cyclic permutations of 1, 2, 3. 

We now show that the hessian is a covariant of the form f(x^, x^); 
that is if a, p, y, 6 are real numbers with 

a6-/?y = ±1, (7) 

then the hessian of the form x^) defined by 

f'(xi. X2) =i[a.x^+px^, + dx^) 

is 

h'{x^,x^ =h{aXi+^X2, yx^ + dx^). 

Indeed this follows at once from (6) and the expressions (7), (8) of § 5-1, 
on noting that 

&iV>k - = (a <5 - /5y) - ^kfj) = ± {&jWk ~ hf,) . 

on using (7)- 

From either (5) or (6) we see that the determinant of h{x^, x^ is 

AC-\B^ = lD{i). (8) 

In particular, h{xy, x^) is definite when and only when D>0, i.e. when 
/ is a product of three real linear forms [when the , y>j are real the form 

(6) is clearly positive definite, but the converse is not so clear without 

using (8)]. 

When the are real, the form / was said by Hermite to be 
reduced when the definite quadratic form h is reduced in the sense of 
MiNKOWSKih 

Every form with real is equivalent to a reduced form. For 
the transformation which reduces the h{x) in Minkowski’s sense also 
reduces f{x ) ; since h{x) is a covariant of f{x), as we have seen. Further, 
this reduction can be carried out in only a finite number of ways since 
we saw that a definite quadratic form can be reduced by only a finite 
number of transformations. 

II. 5.3. We may now enunciate and prove Davenport’s theorem: 

Theorem VIII. Let f[x) he a binary cubic form with discriminant 
D>0 which is reduced in the sense of Hermite (§ 5.2). Then 

min{|/(l,0)|, |/(0,1)|, |/(t,l)l, 1 /( 1 ,- 1 ) 1 }^ ( 1 ) 



1 He could not put it this way, of course! 
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The sign of equality is needed only when 



— ( 2 ) 

or 

±f{xi,±x^)=x\ + 2x\x^-x-^x\-)\, ( 3 ) 

the ± signs being independent. 

Davenport actually proved that if / [x^ , x^ is reduced, then at least 
one of the five products 

i/(l,0)/(0,1)|, |/(i,0)/(t,l)|, |/(0,t)/f1,1)|, 

i/(l,0)/(l,-t)|, 1/(0, 1)/(t, -1)1 

is ^(D/49)^ with equality only for the forms (2) and (3), as before. 
We shall follow Chalk (1949) and prove another generalisation. Let 

h [x^i ,x^ = A x\ B x^x^ + C x\ 

be the hessian of f{x), so that 

0-^B^A^C, A>0 (4) 

Chalk’s result is that 

min{|/(1,0)|, |/(0,1)|, 1/(1, 1)1, 1/(1,- 1)1}^ 

with equality only for the forms (2) and (3). Since AAC — B^^f)A^, 
and AAC — B^ = '^D(f) by (8) of § 5-2, this will be a stronger result 
than Theorem VIII. 

We may suppose by homogeneity that 

A = 7. (5) 

We must then deduce a contradiction from 

1/(1, 0)1^1, |/(0,1)|^1, 1/(1, 1)1^1, 1/(1, -1)1^1, 

except for the forms (2) and (3). On writing 

f{x) = ax\ -\-b x\x^-\- c x^x\ d 
these inequalities are 

|a|^l, \d\^\, (6) 

|a+6+c + <^|^l, |a — 6+c — (i|^1. (7) 

By taking — / for / we may suppose that 



a^l. 



( 8 ) 
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We shall require the identities 

A=b^ — }ac, B = hc — <)ad, C=c^ — '}bd, (9) 

from which follow 

Be — Cb = }Ad, Bb — Ac = '^Ca. (10) 

From (4), (5) and (9) we have 

0^bc-9ad-^7. ( 11 ) 

Suppose, if possible, that ^>0. Then (11) gives 

bc^gad^g. ( 12 ) 



If 6^00 there is a contradiction with (lOi) and if c^b>0 there is 
a contradiction with (lOj), so 

b<0, c<0. 

Then we should have 

A =b^ — } a c 

= lJp + f|ac| +||«c| 

^3(|a2 62c2)4 

by the inequality of the arithmetic and geometric means; and so, by 

( 12 ). 

in contradiction with the normalization A —7. 

Hence we may suppose that 

d<0. 

and so, by (11), 

bc^7 -ga\d\^ - 2. (13) 

If b<0<c we have a contradiction with (IO2), so 

c<0<6, 

and (13) becomes 

fije] ^9fl|<^| - 7^2. (14) 

Further, (5) becomes 

7=^ =J2 + 3a|c|^J* + 3|cl. (15) 

On substituting (14) in (15), we have 

7^b^ + }\c\^b^ + 6lb, 

and so 

\^b^2. (16) 




58 



Reduction 



Similarly we have 



and so 



7^^ + }\c\. 
1 ^ - c ^ 2. 



Clearly a sign of equality can hold in (16) or (17) only if 
a — — d = i , be = — 2. 



(17) 

( 18 ) 



From (14), (16) and (17) we now have 



and so 



9a\d \ ^ 7 + |6c| ^ 11 
\d\^ — . 

— Q II — Q 



But now 

a~b-\-c — d^^ — \ — \ +-^ < f , 

and so 

a — b+c — d-^—\. (19) 

We now consider the two possibilities for /(I, 1). If 

Cl -\- b c d — 1, (20) 

then on adding (19) and (20) we have 

a — |c|^— 1, so |c|^l+a^2. 

Comparison with (17) shows that |c| =2, and, since there is equality 
in (17), we must have (18); that is 



Similarly, if 
then 

and we have 



a = —d=\, b=\, C — — 2 . 

u b c d -{“ 1 , 
b + d^ \ , so b'^2] 
a — — d = \ , b =2, c = — 1. 



This concludes the proof of the theorem. 

II. 5.4. When the binary cubic form / has discriminant D(/)<0 the 
hessian form is indefinite, and so a reduction of the hessian does not 
single out a finite number of reduced forms from amongst the forms 
equivalent to /. However, if Z) < 0 then only one of the linear factors 
of / is real, and / may be put in the shape 

= (i?3^i +^^ 2 ) [P^l + + 



( 1 ) 




Binary cubic forms 



59 



where the form Px\-\-Qx^X 2 + Rx\ is positive definite, since it is the 
product of two conjugate forms with complex coefficients. Davenport 
following earlier workers calls such a form reduced if the quadratic 
form 

Px\ + Qx^x^ + Rx\ 



is MiNKOWSKi-reduced, that is 



and, further. 



^3V*3 = 0 . 



( 2 ) 

( 3 ) 



The last condition may be achieved by changing the sign of if need 
be, which does not affect (2). Davenport (1945b) proves 



Theorem IX. If f [x] is Unary cubic form with discriminant D (/) < 0, 
then there are integers u=)=o such that 






IL * 

23 



If, further, f{x) is reduced, then 

min[|/(1,0)|, i/(0,t)|, |/(1,-1)|, l/(1,-2)|]g 
with equality only when 

fixi.Xi) = a{xl + xlx^ + 2x^x1 + xl) . 



— I* 

23 ’ 



We only sketch the proof and refer to the original memoire for the 
details. We later give another proof of the first paragraph of the 
theorem (Chapter III, Theorem VII). 

We have to show that D{f)< — 23 when 

1 /( 1 , 0 ) 1 ^ 1 , |/( 0 , 1 )|^ 1 , 

1 /( 1 , - 1 ) 1 ^ 1 , 1 /( 1 , - 2 ) 1 ^ 1 , 

i.e. when 

P|^3|^l, (4,) 

|^3-%|(^-^+-R)^1, (4^) 

|^3-2v3l(P-2^+4P)Sl, ( 43 ) 



since P-Q-\-R, P — 2Q + 4R are positive by the positive definiteness 
of the quadratic form. For fixed and y)^, the inequalities (2) and (4) 
restrict the point P, Q, R in 3-dimensional euclidean space to lie in a 
certain infinite region bounded by planes. Davenport shows, 
further, that 



D(f)={Pfl-Q^,f, + Rfl}^(4PR-Q^), 
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and that |Z)(/)|^ is a convex function of (P,Q,R) for fixed ^ 3 ,^ 3 . 
Hence the maximum of D{f) is attained at the vertices of y, where 
three of the plane faces meet [since it is easily seen that | D | ^ 00 as 
max(|P|, I ^|, oo]- The proof then follows from a rather tricky 

estimation of D(J) at the vertices of 6 ^. 

II. 6. Other forms. We briefly survey here results on the reduction 
of forms other than those already discussed. 

II. 6.2. For binary forms of degree there is more than one 
invariant. For example, a binary quartic form f(x^, x^) which is the 
product of two pairs of complex conjugate hnear forms may be reduced 
to the shape 

(p{X) = f(X^. X3) = X\ + GfiXlXl + XI 

where 

Wi = axi+/3x2, X 2 = yx^ + 6 x 3 , 
for some real <x,p,y, d and /i=fx{f) is a real number lying in 

<i- 

Two forms with different fi cannot be transformed into each other by 
a homogeneous linear transformation of the variables. Further, /^(/) is 
an absolute invariant in the sense that —n{f), where t is any num- 
ber. Of course we still also have the discriminant 

Dii)= n 

where 

fiXi.Xi) =17 

i 

The problem for definite binary quartics was solved independently 
by Davis (1951 a) and Cern^ (1952 a) in the sense that they found the 
best possible function y (/i) of fi such that every form / with invariant fi 
has 

inif(u)^y(f^){D{f)}K 

U^O 

integral 

Davis (1951a) also gives some results for indefinite binary quartic 
and full references to earlier work. It is no longer true, as it was for 
quadratic and cubic forms, that forms / with D (/) = 0 assume arbitrarily 
small values. This case was completely elucidated by Davenport 
(1950a). 

The methods of these authors combines reduction techniques with 
other tools drawn from the geometry of numbers. 

There does not seem to be any systematic work on binary forms of 
degree greater than 4 . 
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II. 6.3. The only other types of forms /(^i, .... of degree n with 
m>2, n>2 for which the best estimate of 



M{f) = inf l/(u)| 

«I4:0 

integral 

is known appear to be the ternary cubic forms with real coefficients 
which are expressible as the product of three real hnear forms: 

/ ,X^,X^) = U (i?,! l?,-3 X3) , 

where either all the are real (first type) or ^32i ^33 are real and 
There is an invariant 

Z)(/) = {det(l?,.*)|^ 

This is the only invariant in each type, since there are obvious real 
transformations taking / into 

and 

respectively. The two types are distinguished by Z)>0 and D<0 
respectively. The following two results are known: 

Theorem X. Let f{Xi, x^, x^) be a factorisable ternary cubic form with 
D{f)>0. Then there exist integers u^=o such that 

except when f is equivalent to a multiple of one of the forms 

fi 9 = x\+ x\+ xl—xlx^+’i xlx^~2Xix\-\-6xixl—2x^ x\—xl x^—x^ x^ x^, 

fsi = x\+x\->rxl+6x\x^—'^x^xl->r9x^x\-f,x^xl-'^xix^x^, 



for which M{f)—\ and Z)(/)=49, 81 respectively. 

Theorem XI. Let f{x) be a factorisable ternary cubic form with 
D(f)<0. Then there exist integers u=|=o such that 



|/H^ 



23 ■ 



The sign of equality is needed when and only when f{x) is equivalent to 
a multiple of the form 

f9z = x\+x\-\-x\+2x\x9- x^xl+x^xl-x^xl-'^x^x^x^. 



We note that fi^, f si and fss are all of the shape 



Norm [Xi -\- <p Xs -{■ f x^) , 
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where \,(p,rp are a basis for the integers of a cubic field. We shall 
discuss later the reasons why this might have been expected (Chapter X). 
For fti.fgi and /23 we have y> = 93*, and tp satisfies the respective equations : 

9’® + ?’^— 293 — 1 = 0, 

93® — 393 — f = 0 . 

and 

~ (p — \ = 0 . 

[By Norm is meant the product of the three forms obtained from the 
given one by inserting the three pairs of conjugate values for 99 and tp.] 
The first equation here corresponds in an obvious way to the form in 
Theorem VIII. The third equation here corresponds to the binary form 

xl — — xl 

which is equivalent to that in Theorem IX on making the substitution 

^ Xl , X2 ^ Xl X2 • 

For D>0 Theorem X gives the first two successive minima and 
shows that the second minimum is isolated. The first minimum in 
Theorem XI is not isolated ; but there is a weaker sense in which it is 
isolated [Davenport and Rogers (1950a, especially Theorem 14): see 
also Chapter X]. Theorem X was obtained by Davenport (1943a). He 
had already obtained the first minimum [Davenport (1938 a) and a 
simpler proof in Davenport (1941a)]. A slightly weaker form of 
Theorem XI in which |D/23P + e with arbitrarily small e>0 appears 
instead of | D/23 was given by Davenport (1943 a) ; the full form is in 
Davenport and Rogers (1950a). Chalk and Rogers (1951a) showed 
that every factorisable ternary cubic form with D>0 is either equi- 
valent to a multiple of / or to a form g(x) with 

|g(l,0,0)g(0,l,0)g(0,0,l)l^(|[)'. 

This is analogue of the results about the products of the diagonal terms 
of definite quadratic forms obtained in § 3. 

We do not prove Theorems X and XI here, since in Chapter X, 
following Mordell, we deduce Theorems X, XI from the corresponding 
results for binary cubics (in which, as the reader wiU have noticed, the 
integers 49 and 23 also occur). It is however worth sketching the 
reduction which Davenport use to prove Theorem X : 

Let /(x) be a factorisable ternary cubic with > 0, where we may suppose, 
without loss of generality for our purpose that M(f) — \. Hence / is equivalent 
to a form g such that 



g(1,0, 0) = (l-7?)-‘. 
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where rj^O is arbitrarily small. Hence we may write 

3 

(1 -Tj)g(x) = n K + + ft-^3)- 

f=i 

We consider also the quadratic form 



A (*) = i; (^1 + «/ 3^2 + ft- Xsf- 
i 

From the inequality of the arithmetic and geometric means h(u) ^ 3(1 — for 
all integers uq= o, and it is easy to verify that in fact h (u) ^ 3, with equality only 
when u = {1, 0, 0). Hence h(x) may be reduced in the sense of Minkowski by 
a transformation of the type 

% “t“ I'l 2 ^2 ^13 

ITg— ^ 1122 ^ 2 - 61123 '^3 

■^3 ^32 ^2 -6 H33 3I3 

where the v^j are integers and i'22*'33— t'23t'3 2= ± f- Since h(x) has determinant 
(1 —ri)^D{f) and is reduced, we have bounds for the coefficients. The proof now 
continues by an intricate and delicate chain of computations using these bounds 
and the fact that |g(u)|^1 for all integers u=t=o. 

Davenport’s treatment of Theorem XI starts off with a similar reduction but 
the completion of the proof requires different ideas and the detailed consideration 
of an intractable 2-dimensional figure. 

II. 6.4. The corresponding problem for the product of ti>3 homo- 
geneous forms in n variables has been much -worked on. Estimates 
but no precise results are known, and these estimates were obtained 
by other methods. We shall consider the case of large n in Chapter IX, 
§8. The best estimates for w=4, 5 in print appear to be those of 
2 ilinskas (1941 a) and Godwin (1950a) respectively; but Godwin refers 
to a better estimate for «=4, presumably the Vienna dissertation of 
G. Bohm (1942) also mentioned in Keller’s encyclopedia article 
[Keller (1954a)] but unavailable to me. 

There is however a striking result of Chalk on the product of .the 
values taken by n linear forms when these values are positive. He 
shows that ii L^, ■■■, L„ are n linear forms in n variables x = (xi, x„) 
with determinant 4l =t= 0, then there exist integers u =4= o such that 

Lj{u)>0 (1^/S«), (1) 

nL,{u)^\A\. (2) 

i 

That the imphed constant 1 on the right-hand side of (2) is the best 
possible is shown by the simple example Lj = Xj. Chalk’s theorem is 
indeed more general than the form given here since it refers to the 
product of inhomogeneous linear forms. Consequently we do not prove 
it here, but later in Chapter XI, § 4. 
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Chapter III 

Theorems of Blichfeldt and Minkowski 

III.l. Introduction. The whole ol the geometry of numbers may 
be said to have sprung from Minkowski’s convex body theorem. In 
its crudest sense this says that if a point set in M-dimensional euclidean 
space is symmetric about the origin (i.e. contains — x when it contains x) 
and convex [i.e. contains the whole hne-segment 

Xx + [\ — X)y 

when it contains x and y] and has volume V > 2”, then it contains an 
integral point u other than the origin. In this way we have a link 
between the “geometrical” properties of a set — convexity, symmetry 
and volume — and an "arithmetical” property, namely the existence 
of an integral point in SX’. Another form of the same theorem, which 
is more general only in appearance, states that if A is a lattice of 
determinant if (A) and is convex and symmetric about the origin, 
as before, then ^ contains a point of A other than the origin, provided 
that the volume F of is greater than 2” if (A). In § 2 we shall prove 
Minkowski’s theorem and some refinements. We shall not follow 
Minkowski’s own proof but deduce his theorem from one of Blich- 
feldt, which has important applications of its own and which is 
intuitively practically obvious: if a point set has volume strictly 
greater than d (A) then it contains two distinct points and x^ whose 
difference x^—x^ belongs to A. 

The theorems of Blichfeldt and Minkowski may be regarded as 
statements about the characteristic functions of a set 6X, that is the 
function x (®) which is 1 lix^SX but otherwise 0. There are generaUsa- 
tions of the theorems of Blichfeldt and Minkowski to non-negative 
functions ip{x) due to Siegel and Rado. These we present in §3. 
We do not in fact use these theorems later. 

In § 4 we use Minkowski’s theorem to obtain a characterisation of 
a lattice which is independent of the notion of a basis: a lattice is any 
set of points A in M-dimensional space which (i) contains n linearly 
independent vectors, (ii) is a group under addition, i.e. if x and y 
are in A so are a; ± y, and (iii) has only the origin in some sphere 
Xi-\ where j; > 0. 

In § 5 we introduce the notion of the lattice constant A {SX) of a 
set 6X. This is a number with the property that every lattice A with 
d{/\)<A {£X) has a point other than o in while there are lattices 
whose determinant d (A) is arbitrarily near to A {£X) with no other 
point than o in . 5 ^. In § 6 we discuss at length a method due to Mordell 
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which uses Minkowski’s convex body theorem to evaluate or estimate 
A [ 6 ^) for sets which may or may not be convex. The idea is, roughly 
speaking, to show that if a lattice A of given determinant d{A) =Aq 
has no points except o in then at least A must have points in various 
sets abutting on 6 ^. Since these points belong to A, so do linear com- 
binations of them. These combinations must be either o or lie outside 
In this way more and more information about these points of A near 
y is obtained, until there is a contradiction ; the contradiction showing 
that every lattice A with determinant d{A)=A^ has a point in y. 
This method is particularly effective in 2 dimensions, since the relation- 
ship of the various points to each other then springs to the eye. Con- 
sequently in § 6.2 we give a series of simple lemmas about 2-dimensional 
lattices which are non-the-less useful tools. Mordell’s method is 
apphed, amongst other things, to finding A {S^) when y is the region 

\Xl+Xl\<\. (1) 

This is equivalent to finding the lower bound of the values taken by 
a binary cubic form with negative discriminant. This question was 
discussed but not answered in Chapter II. The proof given here is a 
conflation of several given by Mordell. It uses essentially the algebraic 
background. We remark in passing that Mordell (1946 a) has shown 
that the result obtained generalizes to all regions which look sufficiently 
like (1). Similarly, Bambah (1951a) has proved a result to show that 
all sets which look sufficiently like 

|XiX,(Z, + Z,)l<l (2) 

do, in fact behave hke (2). The set (2) corresponds to binary cubic 
forms with positive discriminant in the same way as (1) does to those 
with negative discriminant. For example Bambah’s result applies to 
regions with hexagonal symmetry and six asymptotes at angles nj}, 
the set of points between two asymptotes which do not belong to 6 ^ 
being convex. Compare Chapter X, §3.3. 

Finally, in § 7 we use Minkowski’s theorem to obtain some results 
about the representations of numbers by quadratic forms; for example 
that every prime p= 4 fn + ^ can be expressed as the sum of the squares 
of two integers; p=ul-\-ul. This is all rather aside from the main 
theme of the book but the proofs are so elementary and so striking 
that they deserve to be better known. 

III. 1.2. It is convenient to introduce here some important defini- 
tions and notions. 

The length of a vector ® = (%,..., x„), namely 

(.V? -) h xy 
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will, as usual, be denoted by 

|*|. 

It satisfies the "triangle inequality” 

\x+y \ ^ |x| +|y| 

for all vectors x and y. The length of a vector is not an invariant under 
all unimodular transformations, unhke most of the concepts we work 
with, but we shall be concerned only with the topology induced by the 
metric |£c| and not the metric itself. Let 

y. = 2a.-,x:,- (1) 

be a real transformation of determinant 

det(a,,) =f=0. (2) 

Clearly 




Since det (a^y) =|= 0, we may solve (1) for the Xj and obtain, say, 

(3) 

i 

Then similarly 
where 

B =max|/3,y|. 

Hence there exist constants Cj, Cj independent of x and y such that* 

0< ^ Ca< (4) 

\y\ 

We shall often make use of the following consequences without 
exphcit reference. 

Lemma 1. Lei A be a lattice in n-dimensional space. Then there exist 
constants rji , depending only on A with the following properties 

(i) If ugA, »eA and \ u — v\<rji, then u and v are identical: 

(ii) The number N{R) of points of A in a sphere |a;| <f? is at most 

Both of these statements are trivially true for the lattice Aq of 
points with integer coordinates. But now (cf. § 3 of Chapter I) if A 

^ This is a particular case of a result to be proved later (Chapter IV, Lemma 2 
Corollary). 
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is any lattice with basis 

A,;) 

then the points of A are just the points ( 3 ) with The truth of 

(i), (ii) in general now follows at once from ( 4 ) and the truth of (i), (ii) 
for Aq • 

III. 1.3. We say that a sequence of vectors (r — i,2,...) con- 
verges to the vector x' as limit if 

lim I ac, — I =0 

in the usual sense. Clearly a necessary and sufficient condition for this 
is that the co-ordinates of should converge to the corresponding 
co-ordinates of x', since clearly 

max|;rj| ^ |x| ^«lmax|;r,| 

for any vector x = {x-^, x„). An immediate consequence of Lemma 1 

ci) is that a sequence of vectors u, of a lattice A can converge only if ti, 
is the same for all sufficiently large r, say 

“r = 

A set y of points is said to be compact if every sequence of points 
x^(:6^ contains a subsequence y^=x,^ which converges to 

a limit in ^ : 

lim J/j = y' € . 

s— »-oo 

A classical theorem of Weierstrass states that a set 6^ in «-dimensional 
euclidean space is compact if and only if it is both bounded (i.e. con- 
tained in a sphere |a;|<A for some sufficiently large R) and closed 
(i.e. if *,6.9^ (l^r< 00 ) and ae' = hma:;, exists, then x'eS^}. 

For the sake of completeness we give a proof of We:erstrass’s theorem. 
Suppose first that 5*’ is a compact set. If y were unbounded, we could find a 
sequence of points such that |a;^|->oo, and then it clearly cannot contain 

a convergent subsequence. Hence a compact set y is bounded. If S/’ were not 
closed, we could find a sequence of points y such that lim = x' is not in y. 
Clearly every subsequence of the original sequence tends to x'. Hence a compact 
set y is closed. Now let y be a set which is both bounded and closed. We shall 
show that y is compact. Let (l^r< 00 ) be a sequence of points of y. We 
may suppose that originally all the x, are contained in a «-dimensional cube 'ifp 
of side 2R for some R. This cube may be dissected into 2“ cubes of side R by 
taking planes through the centre of parallel to the faces. For definiteness we 
take the cubes of side to be closed, that is to include their boundary points. 
At least one of the cubes of side R must contain for infinitely many r. Let 
be one of these. On repeating the original process with 'ifj instead of we obtain 
a cube of side jR contained in which contains x^ for infinitely many r. 
And so on. In this way we obtain a sequence of cubes '<fj (0^s< oo) of side 

5* 
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such that '^s+i contained in Each contains for infinitely 
many r. The cubes define a point x' which is contained in all of them. We may 
now find a subsequence tending to x' as follows : x,^ is any point of the original 
sequence in if rj, have already been fixed with 

. . . <Fj, 

then is any one of the infinitely many indices r>r^ such that x, is in 
Finally, since 

x' = lim X, , 
s-^oo '• 

the point x' is in if , since S/’ is assumed closed. 

There is a form of Weierstrass’ Theorem which is apparently 
more general. Let 

1^r<oo) 

be a sequence of sets A, of m points ac,,, in a compact set Then there 
is a increasing sequence - of integers such that all the limits 

lima;* 

S — ►00 

exist and are in For if 

a;* r (^1 A r » • • * » k f ) » 

the sets A, may be represent by points with coordinates 

\^k^m) in »w-dimensional space. Clearly the set of 
points X = (xy*) with 

is bounded and closed if is. Hence the points X^ have a convergent 
subsequence X,^. Then the clearly do what is required. 

[Alternatively one could make use of the so-called diagonal process. First 
pick out a subsequence 

K, = (»flj »ms) 

of the A, such that ^ is convergent. Then pick out a subsequence C^= (*u, .... 
of the Bj such that is convergent. The sequence is also convergent, being a 
subsequence of the convergent sequence And so on. After m repetitions of 
the process one obtains the required subsequence.] 

III. 1.4. By volume we shall mean in this book Lebesgue measure 
unless the contrary is stated. We shall however have no need of any 
of the more recondite properties of measure ; the sets we shall be mainly 
concerned with have a volume by any definition, for example the 
interiors of cubes or ellipsoids. 

III. 2. BlichFELDT’s and Minkowski’s theorems. We use the no- 
tation and results of Chapter I. To Blichfeldt is due the realization 
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that the following almost intuitive result forms a basis for a great 
portion of the geometry of numbers [Blichfeldt (1914 a)]. 

Theorem I. Let m he a ‘positive integer, A a lattice with determinant 
<f(A), and i/’ a point-set of volume V[^, possibly V[S^ = oo. Suppose 
that either 

V{y)>md{A), (1) 

or 

V{y)=md{A) (2) 

and y is compact. Then there exist w-fl distinct points x^, 
of y such that the differences x^ — Xj are all in A. 

Let bj, . . . , 5„ be any basis of A and let ^ be the generalized paral- 
lelepiped of points 



yi^id (0^y,<1, i^j£n). 

Then ^ has volume 

F(^)=ldet(5i,...,5J|=rf(A). (3) 

Every point x in space may be put in the shape 
o:=u-f-», ugA, 

and this expression is unique, since the points of A are just the 

yi&iH hy„b„. where yi, .... y„ are integers. 

This parallelepiped ^ will play an important part later (Chapter VII), 
where it will be called a fundamental parallelepiped for A. 

For each u g A let ^{u) be the set of points v such that 

Clearly the corresponding volumes V{^(u)} satisfy 

Zv{^{u)} = v(y). (4) 

u 

Suppose now that the first alternative holds, namely V{£^)>mdiA), 
so that (4) imphes 

ZV{^{u)}>md{A) =mV{0>). 

U 

Since the 0l{u) are all contained in there must be at least one 
point which belongs to at least w + l of the ^(ul, say 

»o&^(«,) (1^/^w-pi), 

where the Uj are distinct. Then the points 

®. = »o + W/ 
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are in ^ by the definition of ^(u), and 



®, — ajy = u, — Uy 



eA 
+ o 



(» + /) • 



This proves the theorem for the first alternative. 

Suppose now that the second alternative holds. Let e, (l^r< oo) 
be a sequence of positive numbers and 



hm £, = 0. 



For each r, the set of points (l + £,)ac, clearly has 

volume 

(1 + £,)” V(S^) > V{y) = md(A) . 

Hence, by what we have already proved, there exist points 



such that 



X,;S(i+£,)y (1^/^W+l) 






(say) = Xi, - X,; 



pA 



(* + /)• 



( 5 ) 



By extracting suitable subsequences of the original sequences, and then 
calling them e^.Xj, again to avoid introducing new notation, we may 
suppose, without loss of generality, that 

lim®,-, = (1^/^w+l) 

r — *■ oo ' 

all exist. Since 6^ is now assumed to be compact, the xj are in 
Then, by (5), 

x'i — Xj = lim uJi, j ) . 

' r~*‘Co 

But now the are in A. Hence (cf. § 1.3) is independent 

of r from some stage onwards : 



u,(i,j) =u'[i.j) (r^Fo)- 

Hence 



/ / 



Xi — Xj 






eA 

=4=0 



(4=t=/), 



as required. 

For later reference (Chapter VII) we note that in the proof for the 
first alternative we have implicitly proved the following; 

Corollary. Let Sf he any set of points and. let ,5^ be the set of points 
V of the fundamental parallelopiped which can be put in the shape 



v=x — u, x^y, ueA. 




Blichfeldt’s and Minkowski’s theorems 



71 



Then 

V{ST^)^V[^). 

If no difference —x^ between distinct points of 6^ belongs to A then 
F(^i) = V{9’). 

The first paragraph is clear. The second follows since then no two 
^(u) overlap. 

III. 2.2. From Theorem I we deduce almost at once the following 
theorem which is due, at least^ for m = t to Minkowski (“Minkowski’s 
convex body theorem’’). 

Theorem II. Let ST be a point set of volume V{ST) (possibly infinite) 
which is symmetric^ about the origin and convex^. Let m be an integer 
and let A be a lattice of determinant d[A). Suppose that either 

V{y)>m2’'d(A), 

or 

V{^) = m 2"d{A) 



and ST is compact. Then S/’ contains at least m pairs of points ±Uy 
which are distinct from each other and from o. 

Again we note that the possibility of infinite volume is not excluded. 
Theorem I apphes to the set ^ST oi points ^x, xeST which has 
volume 2^’'V{ST). Hence there exist w + 1 distinct points 



such that 



^Xji^ST (l^/^w + 1), 






iaj. 



€A 

4=0 




We introduce an ordering of the real vectors and write 



Xi>X2 

if the first non-zero component of x^—x^ is positive. We may suppose 
without loss of generality that 



Put 

Then clearly 



aji>a!2>.>x„+i. 

O, ±Ui,..., ±u„ 



1 The general case is apparently due to van der Corput (1936a). 

2 For the definition of these terms see § 1.1. 
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are all distinct. But since x„+je<S^ and y is symmetric. 

Hence 

«y = -2-®y + i(-»m+i)e^ 

by the convexity of y. This proves the theorem. 

For later use we note the 

Corollary. Let y he symmetric about the origin and convex. 
A necessary and sufficient condition that y contain a point of A other 
than o is that there exist two distinct points ^Xi, whose differ- 
ence |Xj is in y. 

If y contains the point 06 A then \y contains the two points ^o 
and —\a whose difference is o; which proves part of the corollary. 
Conversely, as in the proof of the theorem, if ^x^, \x^ are given, then 
|aci— is in y. 

Theorem II is the best possible of its kind for any m. For example 
the convex symmetric set 

|xj|<w, \xj\<i 

has volume m2” but contains only m—i pairs of points of the lattice 
of integral points other than o namely 

±(m, 0, ...,0) {i^u^m — i). 

We shall return in Chapter IX to the general problem of finding 
convex symmetric sets of volume 2”d(A) which do not contain any 
lattice points other than the origin. 

III. 2. 3. Important examples of a convex symmetric point set are 
those sets y defined by a set of inequalities of the type 

or ^ c, (1^/^L), 

where the Ujy are real or complex numbers. Such a set is clearly sym- 
metric. It is also convex, since if x, y are in y and 

z = ?.x + {i — ^.}y (O^A^l), 

then clearly 

IS ^ ^ |S + (f - |S ^ max j|2 a,,- x,| , | Z a,^ y,|l . 

For sets y of this kind one can relax the condition of compactness 
in Theorem II somewhat. We enunciate the theorem for the most 
important case when the are aU real. It will be observed that the 
argument might be used for a wide class of convex sets y. 
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Theorem III. Let A be an n-dimensional lattice of determinant d(A) 
and let a,, (i^i,j^n) be real numbers. Suppose thaA Cj >0 (1 ^/^m) 
are numbers such that 

Ci...c„^|det(ai,)|i(A). (1) 

Then there is a point ueA other than o satisfying 



ISaiyMyl^Ci 

|2a.fM;| <c.- 



( 2 ) 



Suppose, first, that 



det(a,-,) 4=0. 



Then (cf. Chapter I, §3) the points X=^[X^, defined by 



i 

form a lattice M of determinant 



<f(M)=|det(a,,.)K(A). (3) 

The inequahties (2) become 

j Xi\ < c, (2 ^ t ^ n) . J 

These define a set in the space of X of volume 2"ci ... c„. Hence 
if there is strict inequality in (1) the theorem follows from the first 
alternative in Theorem II. Let now e be any number in 

0<e<l. 

Even if there is equality in (1), there is certainly a point other 

than o, with co-ordinates ...,X„^), such that 

l-^i£| = Ci-t-e<Ci + 'l 

|Z,-,|<Ci (2^t^«). 

But now there are only a finite number of possibilities for X^, by 
Lemma i (ii). Since s is arbitrarily small, one of those possibilities 
must therefore satisfy (4). This proves the theorem unless det(a,^) =0. 
But then it is readily verified that (2) defines a region of infinite volume, 
and so Theorem II certainly apphes. 

III. 3. Generalisations to non-negative functions^. The results of 
§ 2 may to some extent be generalised to non-negative functions f (x) 






^ Cj> 0 follows from ( 1 ) except when det (a,y) = 0. But we do not exclude this . 
^ The results of § 3 will not be used later. 
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of a vector variable x. We suppose that ip{x) is integrable and write 

^ f f(x)dx, (1) 

— OO <:?,•< OO 

where 

dx =dx^...dx„. 

This notation is justified, since if y> is the characteristic function of a 
set y”, that is, 

, ^ h a xey ] 

|o otherwise,! ” 

then V{f) is just the volume V{£^) of y. 

We now have the following simple analogue of Blichfeldt’s 

Theorem I : 

Theorem IV. Let ^[x) he anon-negative integrable function and let 
A be a lattice of determinant d[A). Then there is certainly a point «(, 
such that 

d{A)Y,f{v^ + u)'^V{xp). ( 3 ) 

Before proving Theorem IV we note that it certainly implies the first 
alternative form of Theorem I. For if yi is the characteristic function 
of a set y and V{\p) = F(y)>w<f(A) for some integer m, then (3) gives 

+ tt) > w , 

U 

and so 

Zv(»o + m) ^ w + 1 , 

U 

since now y) {x) is given by (2) . But this means that there are m-\-i 
distinct vectors Uj such that , and this is just the conclusion 

of Theorem I. 

The proof of Theorem IV follows that of Theorem I. Let b^, 
be a base of A, and y, as before, the set of 



yibi + --- +y„b„ (0^yy<t); 

so that every x is uniquely of the shape 

x = v-\-u, »ey, weA. 



Then 

F(yi) = / yi(a:) dx 

= Z jf{u + v)dv 
mEA 



= /|Zv(« + »)| 

ImEA J 



dv. 



Since y has volume F(y)=if(A), the theorem now follows at once. 
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II. 3.2. Siegel (1935a) has given a stronger form of Theorem IV which has, 
however, remained rather sterile of applications. For notational simplicity we 
enunciate it only for the lattice Aq of integral vectors. The function 

(p(v) = '^y>{v + u) (1) 

11 6 A, 

is periodic by definition. Its Fourier coefficients c (p) = c(p^, .... ^„), where p€\, 
are given by 

r (p) = / (p(ti) (ft), (2) 

where (pti) denotes the scalar product 

t^i + ■ • ■ + t/„ . 

On substituting (1) in (2), we have 

<^(P)= / dx, ( 3 ) 

— 00 < Xj <00 

since pu is an integer when p6A„, uSAj. In particular, 

J(p{v)dv==c{o) = V(y!). (4) 

But now, by a fundamental theorem in the theory of Fourier series, 

J(p^{v)dv = 'Z,\c{p)\‘. (5) 

^ peA. 

Since q>{u)^0 for all v, there must be some v,, such that 

f <p^(v)dv ><p{Vq) f <p{v)dv = (p(vq) V{y)). (6) 

On substituting the definition of 95 (Wj) and the values (3), (4), (5) in (6) we have 

Zy>{Vo + ^•)=viVo)^V{V>) + {V(y>)^^Z\h(x)e-^^'^P^^dx\\ (7) 

m£A„ peA„ 

P4=o 

This is Siegel's inequality. 

When a general lattice A is substituted for Aq on the left-hand side of {7) then 
A* must be read for Aq on the right-hand side, where A* is the polar lattice of A 
defined in Chapter I, § 5. 

III. 3.3. We now give Rado’s generalisation of Minkowski’s convex 
body theorem II. [Rado (1946a), see also Cassels (1947a).] Rado 
considered very generally a homogeneous linear mapping X of M-dimen- 
sional vector space into itself given by 

( 1 ) 

when X="kx. We write det(X) =det(A^y). 

Theorem V. Let y>{x) be a non-negative function of the vector x in 
n-dimensional space which vanishes outside a hounded set, and suppose that 

Vi(Xa? - Xy) ^ min {f{x),y){y)} 



( 2 ) 
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for all real vectors x and y. Then 

( 3 ) 

ugA ^ ' 

for any lattice A, where 

^iw)= / 

— OO <Xj<00 

(l^lSn) 

Before proving Theorem V we note that it does in fact imply the 
first alternative part of Theorem II. Let y>{x) be the characteristic 
function of a convex symmetric set so that F(^) = For X we 

merely take X» =|a;, so that det(X) = (^)”. The condition (2) is certainly 
satisfied, since the right-hand side of (2) is 0 unless both x and y are 
in 6T ; and then 

\x-\y = ^x + \{-y) 

is also in y by the convexity and symmetry. On the other hand the 
left-hand side of (3) is ^ -f 1 , where p is the number of distinct pairs 
A in other than o. Hence if V{y>)>m2’'d{A), we have i+p>m, 
that is p'^rn] which is the conclusion of Theorem II. 

To prove Theorem V we need an elementary combinatorial lemma. 
Lemma 2. Given any sequence of distinct vectors 

we can construct another sequence 

satisfying the following three conditions: 

(i) Wo = o. 

(ii) w,=f=±t», if r=t=s, 

(Hi) every w, is the difference between two of the first r-fl elements 
of [z], say 

m,^r). ( 4 ) 

We introduce an ordering of real vectors and write 

a!!i> ®2 

if the first non-zero coordinate of x^—x^ is positive. If Xi^^Xf^ then 
either Xi>X 2 or X 2 >Xi. We construct w^, ... in turn, so that 

tc,> o {r>0). 

The vector Wq is given. Suppose that tCj, M?,-! have already been 
constructed, where r^i. There is a unique permutation Z),^(0^f^r) 
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of the vectors (0^/^r) so that 
The r vectors 

(7=1.2 r) 

are distinct from each other and from o. Hence we may choose as w, 
one of them which is distinct also from the r — 1 vectors iCj, 

Since we cannot have w,= —Wj. Hence the 

do what is required. 

Theorem V, will be an almost immediate consequence of the following 
Lemma. 

Lemma 3. Suppose that (2) holds and that det(X) 4 = 0 , so that a trans- 
formation reciprocal to X exists. Then 

2y;(X-iu-f X^i*)^^(o) -f (5) 

«4€A m6A 

+ « 

for every real vector t. 

For fixed t let z, be the sequence of vectors « of A such that 
y)(X~‘2-f X"^t)>0 arranged so that 

V>(X-i«, + X-i<)^y(X-i», + X-it) (r^s). (6) 

Let w, be the corresponding sequence defined by Lemma 2. We apply 
(2) with 

x = x, = X“i«,_, + X“i< 

where and m, are defined by ( 4 ). Then 

min {rp (ae,) , y) (j/,) } ^ y; (X-i + X'l t) ( 7) 

by (6), and since m,^r. But now, by ( 4 ) again, 

and so, by (2) and (7) 

V (m 5 ,) ^ f (X“i a, + X-i t) . 

Similarly, on interchanging x^ and y, , we obtain 

(X'l 2;, + X-i t ) . 

Hence, since y)^ 0 , we have 

(ll) Syj(Wo) +Z {V'(M’r) +W{-^r)} 

ugA r>0 

^ y> (X-i + ^-^ *) + 2 Z V (X-i «, + X-i f) 

f>0 

= - y; (X-i Z, + -K~H)+ 2 Zv <) . 

u$A 



( 8 ) 
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since every vector Mg A with y(X ^m+X >0 occurs as a But 
now (2) with y=x implies that rp{o)^y){x) for anyjc, and in particular 

+ ( 9 ) 

The truth of the lemma follows now at once from (8) and ( 9 ). 

Finally Theorem V follows from ( 5 ) on integrating with respect to t 
over a fundamental parallelepiped ^ of A defined as in §2.1. The 
left-hand side becomes 

// 2 v(X-' 1 m - bX-if)Ut = / xpCk-H)dt=\det(\)\V(f). 

^ lucA J — oo<tj<oo 

The right-hand side of ( 5 ) is independent of t and so, on integrating 
with respect to t, is merely multiplied by V{ 3 ^) =i(A). This proves the 
theorem. 

Rado (1946a) discusses the homogeneous linear transformations X 
for which there is a function y>{x) which is not identically 0 satisfying 
(2). It turns out that X must satisfy pretty stringent conditions, and 
that taking multiplication by \ for X is in a sense on the borderline of 
what is possible. 

III. 4 . Characterisation of lattices. We are now in a position to 
give a characterisation of lattices in which the notion of a basis does 
not appear. 

Theorem VI. A necessary and sufficient condition that a set of points 
A in n-dimensional euclidean space be a lattice is that it should have the 
following three properties : 

( i) If a and h are in A then a±:b is in A. 

( a ) A contains n linearly independent points Mj , . . . , a„ . 

(in) There exists a constant rj >0 such that o is the only point of A 
in the sphere 

I a; I <77, 

where, as usual, 

|a;| = (^?-b +xl)^. 

By the definition and Lemma 1 every lattice satisfies (i), (ii), (iii). 
It remains to show that any set A satisfying (i), (ii) and (iii) is a lattice. 

We note first that it follows by induction from (i) that if Cj , . . . , c„ 
are any points of A and u^, ...,u„ are integers, then 

MjCiH (u,„e„e/\. 



Secondly, we show that if 
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are « + 1 points of A, then there are integers Uj (1 + 1) not all 0 

such that 



2 UjCj = 0. 



For by Theorem II there certainly exist points (mj, M„_,_j)=t=o of the 
(« + l)-dimensional lattice Aq of integral vectors in the convex sym- 
metric (m + 1) -dimensional set of infinite volume defined by the n 
inequalities 

2 CijuA<rjln 
IS/Sn + l I 
Put 



so that trivially 



d = 2 
ld| <7j. 



Then d=o by property (iii), as was required. 

Now let Mj be the lattice with the basis a^, given by (ii). 

Then is a subset of A. If A coincides with there is nothing to 
prove. If not, there is some vector b in A but not in Mj. But now, 
on applying the result of the previous paragraph to the « + f vectors 
Oj , . . . , a„ and b, there must be integers and v not all 0 sucb 

that 

vb=Miai-l (f) 



Here i^=4=0, since aj,...,a„ are linearly independent. Further, 
i) 4= 1 since b is not in by hypothesis. We may suppose that b is 

chosen so that | in (t) is as small as possible. Let p be a prime divisor 
of V and write 

V =pv^ bj = Vjb. 

Then 

pb^ = u^ai-\ 

where not all of Mj , . . . , m„ are divisible by p since is not in (because 
V was chosen minimal) . Without loss of generality, p does not divide , 
and so 

Ip — WMj = t 



for some integers I and m. Put now 

a'l = la-^ — mb^ | 

a,' = (2 ^ ^ , J 

so that conversely 

Cj = pa[ -f wMjaa -f mu„a'n 

Oj = a' (2^ n) . 

Let Mg be the lattice with basis a-. 



( 2 ) 



( 3 ) 
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Then Mj has index ^ in Mj, so in particular 

(4) 

But now, by (2), a basis of is in A and so Mg is entirely contained 
in A. We may now repeat the argument. If Mg does not coincide with 
A there is a third lattice Mg which is in A and contains Mg as a sublattice. 
And so on. Now, by (4), 



If 






d (M,) < (r?/«)". 



where rj is defined in (iii) of the enunciation of the Theorem, then, by 
Theorem II, M, would contain a point d =)= o with 

contrary to hypothesis. Hence the chain of lattices Mj , . . . , M,., . . . must 
have a last, M^j; and M^ then coincides with A. 

III. 5. Lattice Constants. We must now introduce a number of new 
definitions relating to lattices and points sets. The new concepts will 
be subjected to a searching analysis in Chapters IV and V ; here we just 
prove enough to show their use and to enable apphcations of Min- 
kowski’s theorem to be made. 

Let be any point set. If a lattice A has no points in other 

than o (if o is in then we say that A is admissible for 6^ or 

.5^-admissible. We call the infimum (greatest lower bound) of d{A) for 
all A-admissible lattices the lattice constant of 6^ and write 



= inf (A) (A is y’-admissible). 

If there are no y’-admissible lattices then we say that y is of infinite 
type, and write A {^) = oo ; otherwise y’ is of finite type and 
0^/d(y’)<oo. An y’-admissible lattice A with d(A)=A(S^) is said 
to be critical. Critical lattices play a very prominent role in Chapter V. 
Of course in general there is no reason why a general set y’ should have 
critical lattices at all. 

Our definitions do not quite correspond with those of Mahler 
(1946d, e). He is usually concerned with closed sets y and says that 
A is y’-admissible if no interior point of y except o belongs to A, that 
is if A is admissible in our sense for the set of interior points of y. 
Our usage is a compromise between Mahler’s and that proposed by 
Rogers (1952 a). 

III. 5. 2. The definition of A (y) may be stood on its head: A (y) is the 
greatest number A such that every lattice A with d(A)<A has a point 
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other than o in The discussion of § 4 of Chapter I shows that many 
of the results of Chapter II may be interpreted as giving the value 
of A ( 6 ^) for certain regions Sf . Take for example the statement that 
if /(x) = /i 1^1 + 2/12X1X2 + /s 2^2 is a definite quadratic form and 
£) = /11/2J — /^2, then there are integers u = (mj , M2) =1= ® such that 
/(«) ^ (4 D/ 3 ) i , with equality only for forms equivalent to /j 1 (xf + Xj X2 + x*) 
(Theorem II of Chapter II). This is equivalent to the statement that 
the 2 -dime„sio„al set 

has lattice constant A(^) =(f)i and that the critical lattices are pre- 
cisely those with a base bi=(6n, b^i), b^=(b^2’ ^22) such that 

(61 1 Xi + 61 2 X2) ^ + (f>2 I ^1 + *2 2 ^2) ^ = ^1 + *2 + ^2 (2) 

identically. The reader will have no difficulty in making the translation 

for himself (cf. Lemma 4 of Chapter II. We can also make a geometrical 
interpretation of (2l. Put 

ill = cos 1?, ftji^sind, 

612 = cosy), i>22 — siny). 

Then (2) is true provided that 

cos & cos y) -f sin i 9 sin y) = ^, 
that is provided that . , . 

Hence the critical lattice has as basis two points at angular distance 
jr/3 on X?-fX| = l. A further point on Xl-{-Xl = \ is bj— 62, as is 
clear from (2). It is readily verified that the six points ±61, ±^>2> 
±(*>1-5.2) are the vertices of a regular hexagon inscribed in X\ + Xl = 1 . 

III. 5 . 3 . In this and in the next section we shall use Minkowski’s 
convex body Theorem II to evaluate or estimate A ( 6 ^) for various 
sets 6 ^. Theorem II is directly apphcable when 6 ^ is symmetric and 
convex, since it asserts that then 

A{y)^ 2 -’'V(^). (1) 

This applies for example to the circular disc^: Xi±A|<l and gives 
Zl (^)^ 71/4= 0.785 ..., which may be compared with the exact value 
( 1)1 = 0.866 ... obtained above. 

Even if our region 6 ^ is not convex or symmetric, we may obtain 
estimates for A (^^) below if a convex symmetric body ^ is inscribable 
in it. Clearly zl {S^)'^A{^) if is a subset of y, since every y-admis- 
sible lattice is automatically y-admissible. Hence 

zi(y)±zi(y)^2-"F(y). 

Cassels, Geometry of Numbers 
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Consider for example the region 

\X,...X„\<i. 

This contains the convex symmetric region 

|Zi| +■•• +|Z„|<n 

by the inequality of the arithmetic and geometric means. Now ^ is 
convex and symmetric, since it is defined by homogeneous linear in- 
equalities, and its volume is 

Hence 

A{^)^n’'ln\. 

We shall later obtain a rather better estimate than this (Chapter IX, 
§ 8). We note the translation into the theory of forms: Let 

Lj{x) = 2 

be real linear forms in the n variables x = {Xi, , x„) with det(c,,) =(=0. 
Then there exists an integral u4=o such that 

77i,(M)|^~|<let(c.,)|. 

I I 

Minkowski’s convex body theorem also permits the evaluation of 
A [S^) for sets which are not symmetric in o. We reproduce here, 
with his kind permission. Professor Mahler’s elegant treatment of the 
simplex, hitherto unpublished^. Let be an open simplex in w-di- 
mensional space containing o. If the faces of 6^ are given by the equa- 
tions 

Lj{x)=i 

where the Lj{x) are linear forms, then is the set of points satisfying 
Lj{x)<\ (0^/^w). 

There is one non-trivial relation between the linear forms, say 

2 =0 

0£/gn 

identically in x, where the a,- are real numbers, and without loss of 
generality 

ao> 0. 

* It is given, however, in his mimeographed lecture course, Boulder (Colorado), 
U.S.A., 1950, together with other interesting results about non-symmetric sets. 
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If, say, ai ^ 0, then would contain the infinite ray of points x satisfying 
Io(®)^0 L^{x)=0 O' 4= 0,1); 

which is impossible, since 4'’ is a simplex. Hence 



oij >0 (0 ^ ^ m ) . 

We may suppose without loss of generality that 



and then 


ao = 1 = min , 


(2) 


where 


io(*) = “ 2 


(3) 


We show that 


a, ^ 1 . 


(4) 




A{^) =2~’'V{^), 


(5) 



where V{^) is the volume of the parallelopiped 

|Iy(a!)|<1 {\^j-£n). 

In the first place, if A is a lattice with tf(A)<2~"F('^), then there is 
a point a =1= o of A in By taking — a instead of a if necessary, we may 
suppose that 

Io(a)^0, 

and then 

Lj [a) <1 (0 ^ ^ m) ; 

so a is in y. Hence A{£^) ^ On the other hand, we shall show 

that the lattice M of points a such that 

Lj{a) =Uj = integer (1 ^ / ^ w) 
is admissible for 6^. 

If a is in y, we must have Uj^O (1^/^m), and min«^£= — 1 if 
a=\=o. But then, by (4), we should have 

Lo{a) = 

and so a is not in Hence o is the only point of M in Since 
(M) = 2"" F('^), this completes the proof of (5). We note that 
2^"F('^) =|cfo|“\ where is the determinant of then forms I-i, 

By (3) and (4), d„ is the least in absolute value of tire determinants of 
selections of n out of the m + 1 forms Z.^, 

Estimates of A [£^) for non-convex sets SA may be obtained from 
Theorem I instead of Theorem II. Let ^ be any set such that all the 

6 * 
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differences 

— aja€^ (6) 

lie in Then 

since by Theorem I if i(A)<F(^) there exist two points 
such that »i— x^gA; and by hypothesis Xj— Xjg^^. Of course if ^ 
is a convex symmetric set inscribed in y we could take ^ but 

then we get just the same estimate J (y)^2~"F(y) as by the use of 
Theorem II. However Mordell and Mullender found suitable sets 
M in the case they were treating such that F(^) was greater than 
2"”F(y) for any convex symmetric inscribed y The increases are 
usually comparatively small and obtained at the expense of some 
complication. We refer the reader to Mullender (1948a) and the 
hterature quoted there for further information. 

In Chapter VI are obtained upper estimates for zl (y) in terms of 
F(y) which are valid for all sets (Minkowski-Hlawka Theorem and 
related topics). 

III. 6. A method of MORDELL. In this section we develop a method 
of Mordell for finding J (y) precisely for point sets y which may or 
may not be convex. The method applies primarily to star bodies. 
This class of sets is defined by the properties that the origin is an inner 
point and any radius vector meets the boundary either not at all or in 
precisely one point: in other words, if x is any vector other than o, then 
either fxgy for all or there exists a (q such that tx is an inner 
point of y, a boundary point of y or not in y according as i<io, 
t = or t>tg. We now have the rather trivial 

Lemma 4. Lei 5^ be a star body and suppose that a constant exists 
with the following two properties. 

(i) every lattice A with d[/\)=Ag has a point other than o in or on 
the boundary of y. 

(ii) there exist lattices \ with di/K^^A^ having no points other 
than o in the interior of y. 

Then A (y) =Aq. If further, y is open‘d, then the critical lattices are 
fust the . 

For suppose, if possible that M is an y-admissible lattice with 
if(M)<z]o. Let y>1 be defined by y"if(M) =Aq. Then the lattice yM 
of points yx, X6 M has clearly no points in or on the boundary of y, 
contrary to (i). Hence A(S^)'^Ag. On the other hand (1+fi)A^ has 
no points in y for any e>0, where A^ is one of the lattices given in (ii). 

* i.e. does not contain any of its boundary points. Minkowski and following 
him M.'^hler define a star body to be closed. We depart from their nomenclature. 
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Hence /I {9") ^ (1 + e)"Zlo> so A [9] =A^. The truth of the last sentence 
of the lemma is now obvious. 

When the description of star-bodies by distance-functions is intro- 
duced in the next chapter, Lemma 4 will fall into place as part of a 
wider theory. 

Mordell’s method of finding A (9) for a given star-body 9 may 
now be described. First one must make an intelligent guess Aq at A {9) : 
in particular so that (ii) of Lemma 4 is true. If A^ has been correctly 
chosen, then it may be possible to verify (i) and to find all the in 
(ii) by the following general procedure, of which the details naturally 
vary widely from case to case. We suppose for simplicity that 9 is 
open. Let M be any ,9^-admissible lattice with ^f(M)=Zlo. Then if 
9j(\^j-^r) is any collection of closed convex symmetric sets each of 



there must be points p,=i=o of M in 9^ for Since M is 9- 

admissible, the p, must lie in , the set of points of 9^ which are not 
in 9. We may now use the hypothesis that the Pj are in a lattice M 
of determinant A^, to obtain further points of M. Since these cannot 
lie in 9, this gives further information about the p,. In the end it 
may be possible to show that M is one of a set of lattices A^, all of which 
have points on the boundary of 9. Lemma 4 shows that A (9) =Aa- 
Of course the power of the method depends on a suitable choice of the 9^. 

Mordell’s method is at its best in deaUng with 2-dimensional 
regions, since for these it is easier to grasp the geometry of the figure. 
Before giving some concrete examples we must therefore study the 
geometry of a 2-dimensional lattice more closely. 

III. 6. 2. Throughout § 6.2 we denote by A a 2-dimensional lattice. 
We regard vectors as coordinates of points on a 2-dimensional euclidean 
plane, and use the normal geometric language to discuss their relations. 
By distance we mean the usual euclidean distance. For later reference 
we formulate our conclusions as lemmas. 

We say that a point u of a (not necessarily 2-dimensional) lattice 
is primitive if it is not of the shape u = kuy, where Uj€A and k> \ is 
an integer. 

Lemma 5- Let u be a primitive point of the 2-dimensional lattice A. 
Then the points of A lie on lines it, (r=0, ±L ■•■) which are parallel 
to ou and at a perpendicular distance 



rd{A)l\u\ 

from if'-. Each line tt,. contains infinitely many points of A and these 
are spaced at a distance |u|. 

^ As before |u| = that is the distance from o to u. 
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This is just a re-statement in geometrical language of what is known 
already. Since u is primitive, there is a point v which with u forms a 
basis for A (Chapter I, Theorem I, Corollary 3). Hence 

det (u, ») = ± (A) , 

that is the perpendicular distance from v on the line through o and u 
is d{/\)l\u\. But now A is just the set of points 

rw-|-sM (r, s integers). 

Clearly the points with r fixed but s varying he on a line tt, with the 
required properties. 

Lemma 6 . Let u, v be points of the 2-dimensional lattice A such that 
o, u, V are not collinear. Then a necessary and sufficient condition that 
u,v be a basis for A is that the closed^ triangle ouv should contain no 
points of A other than the vertices. 

The condition is clearly necessary, by Lemma 5, so we must prove 
it sufficient. If there are no points of A in the triangle ouv other than 
the vertices, then the same must be true of the triangles with vertices 



and 



— M, O, V —U 

— V, u — v, o, 



(1) 

( 2 ) 



since, for example if as is a point of A in (1), then as -fu is a point of A 
in triangle ouv. Similarly there can be no points of A in the images 
of our first three triangles in the origin, since — a: is in A if as is. Hence 
there is no point of A in the hexagon with vertices ±u, ir, 
m) except o and the vertices. By Theorem II 



But 



d(A)^lV{je) =i\det{u.v)\. 



|det(u, »)| —Id{A), 



where the integer I is the index of the points m, r in A (Chapter I, § 2.2) ; 
and so / = 1 , as required. 

The analogue of Lemma 6 does not hold in space of dimension > 2. 

Lemma 7. Let 3. be an open parallelogram with o as centre of area 
4d{A), which contains no other point of A than o. Then A has a basis 
consisting of the mid-point of one of the sides of 3 and a point on one of 
the other pair of parallel sides. 



^ i.e. the sides are counted as belonging to the triangle. 
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After a suitable transformation of coordinates, we may suppose 
that 2 is the parallelogram 

J; 1a:i|<i, \X^\<\ 

and that d[l\)=\. By Theorem III there is certainly a point of A 
other than o in |X 2 |<t, and so A must contain a point 

U = (1, Mg) 1 ^ 2 ! < I • 

Similarly, A must contain a point 

17 = ( 1 ^ 1 , I) 1 ^ 1 ! < t . 

But now, since d{t\)=\, the index of {u,v) in A is 
I = I det(u, «)| = 1 — u^v^. 

But I is an integer and |« 2 ^il<'l- Hence 7 = 1 and either M 2 = 0 or 

Vi = 0. 

Lemma 8. Let t\he a lattice of determinant d (A) which has two points 
other than o in the closed parallelogram with vertices o, a,h, a -\-b and 
volume ( area ) d (A) . T hen either 

(i) the two points are collinear with o, 
or 

( a ) one of the points is a and the other is on the line-segment b, a + b, 
or 

( Hi ) one of the points is b and the other is on a, a -L b. 

For the points p, q, say, are of the type 

p = TtiU -\- n^b , q = x^a K^b 

where 

, 0 ^ ^ 1 (/ = 1 , 2) . 

The index 1 of p, q in A is 

I = \ nxX^ — 7T2>«i1 . 

Hence 711 X 2 — 712^1 = 0 or T: I ; which gives the three alternatives quoted. 

III. 6. 3. We first illustrate Mordell’s method with an example 
where the amount of subsidiary argument required is a minimum. 

Let Jf be the cross-shaped 2-dimensional region defined by 

min{|xi|, |x2|} < 1 , max{|xi|, |x2|} < |. 

We shall show that 

Zl (df) = 2 




88 



Theorems of Blichfeldt and Minkowski 



and that the only critical lattices of c€ are those with the following 
bases: 

Ai basis (1,1) and (1,-1) 

Aa basis (I, -|) and 
Agbasisd,!) and d,f). 

It is readily verified that these lattices are Jf-admissible and have 
determinant 2. Hence by Lemma 4 , it is enough to show that any 

JT-admissible lattice A with 
if (A) = 2 must be one of Aj , 
Aj, A3. 

From now on we suppose 
that 

if (A) = 2: A is JJT-admissible. 

The convex symmetric oc- 
tagon 

I .^2 1 < T > 
l^i| + |:«2| <f 

has area 

y>2^^(A), 

and so contains a point a 4=0 of A. The only points of ^ not in JT 
are the four triangles with |xi|^l, |xa|^l (see Fig. 5 ). Hence, by 
symmetry, we may suppose that A contains a point a = (nj , Hj) with 

fl . 1 = 1 ^ 1 'C ^ , l=I?2^^t 1^1 4 1^2 ^ * (1) 

By Theorem III there is a point b=^o of A in 

ki|<f 1^21^2. 

On taking — b instead of b if necessary and using the fact that b is 
not in JT, we may assume that, the coordinates of b satisfy 

b: |f'i|<l, t^f>2^2. (2) 

Similarly there is a point c of A satisfying 

c: |^Ci^ 2 , lc2|<l. ( 3 ) 

Now we show that a, b is a basis for A. We have 
det (a, b) = aj f>2 — <22 f*i > 1 • I — f • 1 =0 
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and 

so 



det(a, b) < |- 2 + f -1 = f, 



det (a, b) =2 or 4, 



since det (a. 6) is an integral multiple of d(A). Suppose first, if possible, 
that det (a, b) =4, so that the index of a, b in A is 2. For any integer 
A>1 the points k'^a, k^'^-b clearly lie in Jf and so are not in the Sf- 
admissible lattice A: that is a and b are primitive points of A. We 
show now that | (b — a) is in A. Since a is primitive there is a basis 
a,d where, say, det (a, d) (A) = 2. Than b=ua-\-vd for some 
integers u, v, and indeed v — 2 since det (a, b) = 4 = 2 det (a, d). Then 
u is odd since b is primitive, so \ (b — a) is in A as asserted. But 
\ (b — a) is clearly in so we have a contradiction. Hence we can only 



have 








det (a, b) = 2 = d{A). 


(4) 


This gives the estimate 










(5) 



since otherwise we should have the contradiction 



2 — o-ihi ’f + l ' i 

Similarly 

det {a, c) = — 2 = — d (A) 

and 

c > — i 

Since a, b is a basis for A we have 



( 6 ) 

(7) 



c = sa + rb 

for some integers r, s. On substituting this in (6) and using (4) we 
obtain r = — 1 and so 

b + c = sa (8) 



i 2+^2 = ^^ 2 - 

But 

"2 ^ “b Cj ^ 3 > 1 — ^ ^ '2 

by (1), (2), (3); so there are only the two possibilities 

s = 1 or s = 2. 

First case s = 1. From (1), (2), (3) and (9) we have 

^i<0, C2<0. 

From (4j, (5), (6) we have 

det(c, b) = 2 



(9) 



(to) 
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that is 






But Ci^f, 62^1 by (2) and (3) ; and 0>&i^ — I by (5), 

(7) and ( 10 ). Hence (8) can hold only if 

Cj = ^2 = ^ , C2 = ^ , 

which gives the lattice \ . 

Second case s=. 2. By (1), (2), ( 3 ) and (9) we now have 

^>1^0, Cj^O. (11) 

We now consider the lattice-point 

{di,d^) = d = {b - a) = l(b - c) . 

By (2), (3) and (11) we have 

0^2di = bi~ c^^ — 2, 

0 < 2(^2 = ^2 ~ f ^ 2 ^ 2 . 

Since d cannot be in jT we must have di= — i, ii2=-|-l; that is 
Ci = 62=2, 6j = C2 = 0. This gives ai = a2=l. Hence A=Ai- 

In the proof we have made use of the symmetry of the figure. Since 
Aj remains unchanged under transformation of Jf into itself, but A2 
and Ag may be interchanged, we have shown that A is one of Aj , A2 , Ag , 
as required. 

III. 6.4. As a second example of Mordell’s method we take the disc 

xl + xl<i, 

which we have already discussed by other means (§ 5 . 2 ). We take 
/lg=(|)i in Lemma 4. The lattices A^ certainly exist; since they can 
be taken to be the lattices with a basis consisting of two of the vertices 
of an inscribed regular hexagon. We shall show that if d(A)={^)\ 
then A has a point other than o in ^ except when A is a A^. 

There are certainly points of A in the circle 

Xi X2<Z 2 , 



since this has area 2n>2^>2^d(A). Since A is ^-admissible, the point 
must lie in \^xl-\- xl<2. After a suitable rotation of the coordinate 
system we may thus suppose without loss of generality that there is 
a point p = {pi, Pi) in A with 

p2~ ~ ii)^> 



But now, by Theorem III there is a point q=^{q-i, other than o in 
the half-open parallelogram 



J : I Xj -f- 3 ^ ^2 1 = f 



<y' 



3 

4 



X. 
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of area 2]/^=4<^(A) (see Fig. 6). The only portion of J not contained 
in is the curvilinear triangle ^ cut off by the arc of the circle between 
a = (1 , 0) and b = (| , — y|) and the image of ^ in the origin. We may 
suppose without loss of generahty that q is in 

Clearly both p and q are primitive, since, if either were of the shape 
ku with u>zA and integer ^>1, then u would be in S. Further p=\=q, 
since /> 2 = — |/f but l 92 l< | We now apply Lemma 8. From what 




has just been proved, p, q, o cannot be collinear. Hence either p = b 
and q Hes on the Une-segment between a (inclusive) and a-\-b (exclusive) 
or q=a and p hes on the hne-segment between b (inclusive) and a + 5 
(exclusive) ; and by symmetry we may suppose the second holds. Then 
p — q =p — a lies between b—a (inclusive) and b (exclusive) . The only 
one of these points not in ^ is b — a. Hence also p = b. Hence A 
is the lattice generated by a and b. Since we made an arbitrary rotation 
of the coordinate system this completes the proof of the result stated. 

III. 6.5. As a final application of Mordell’s method we prove a 
result about binary cubic forms which fills a gap left in Chapter II, § 5. 
We use the same notation. 

Theorem VII. If /(Xj.Xj) he a binary cubic form of determinant 
D<0, there are integers {u^, m 2 )=|=o such that 

|/(“)1 ^ 

The sign of equality is needed when and only when f is equivalent to a 
multiple of 

xl — Xixl — xl. 



A i 
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This is the most important part of Theorem XI of Chapter II, which 
was left unproved. As already remarked, the form here is transformed 
into the form there by the substitution X2-^ — {x;i-\- X2). We 

already noted that the exceptional form does require the sign of 
equahty since it has Z) = — 23 and represents only integers other than 0. 

We must first express Theorem VII in a geometrical form. We saw 
in Chapter II that any two binary cubics with negative discriminant 
can be transformed into one another. It is convenient to take Xj + X\ 
with discriminant — 27 as standard. Then Theorem VII is equivalent to 

Theorem VII A. Let A be a lattice with 

^(A)=(|-f=Zlo (say) (1) 

in the two-dimensional space of vectors X = (Xi, X^. Then A contains a 
point other than o in 

ST-. |X? + Xi|<1, (2) 

except when A has a basis a — {ai, a^, 6 = (ij, b^ such that identically 
(«i Xj - x'2)» + (aj Xi - Jj X2Y = x\ — x'l x\-x\. (3) 

In stating the equivalence of Theorem VII and Theorem VII A we 
have tacitly applied Lemma 4 to the star body From now on we 
shall be concerned only with Theorem VII A. We use capital letters 
to denote points and coordinates, except that o is still the origin. 
Further, A is a lattice with d(A) given by (1) which has no point other 
than o in the set defined by (2). The set is shown in Fig. 7 . 

First, since Ao<i, there is certainly a point P=|=o of A in the square 

lZj|< 1 , 1X2! < 1 . ( 4 ) 

Since P does not he in either P or — P must he in the first quadrant 
and we may suppose without loss of generahty that 

0^Pi<l, 0 ^P 2 <f- ( 5 ) 

From Fig. 7 (or from elementary algebra) we must have 

Pi + P^^l. (6) 

Suppose, if possible, that there were two such points, P and P'. Then 
their difference P" = P — P' satisfies ( 4 ). Hence on interchanging P 
and P' if need be, we may suppose that P" is in the first quadrant: 
of course it may coincide with P or P'. Hence 



P = P'+P". 
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But now, in the obvious notation, we have jPi+^V = B '+P 2 " ^ 1. 
since neither P' nor P" is in 6^. Hence we should have 

Pi + P2 = (p: + P 2 ) + (Pi + K) ^ 2 , 

in contradiction to (5). To sum up what we have proved so far; there 
is precisely one pair of points ±P^f\ other than o in the square 




I I < 1 , I Xj 1 < 1 . We denote from now on by P the point of A which 
satisfies (5) and (6). 

We now examine more closely the lattices which satisfy (3). We 
must make use of the algebra developed in Chapter II, § 5- Let A^, B^, 
A^, B 2 be any numbers such that identically 

(A^xi- B^X 2 y + {A 2 Xi- B 2 X 2 Y = xl-Xixl-xl=f„{x) (say). (7) 
On equating the hessians of both sides, we obtain 



— 9 (^iB 2 — ^1^2) {^2^1 
5Vo 



B2 X2) 



dx^di,^ 
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The linear factors of both sides must coincide, and so, after inter- 
changing Ai, B I and A^, B^ii need be we have 

^1 Xj B^x^ = A^ jiTj -f- ^ , 

A^Xi- B^x^ = Ai^XiA- ■ 

On comparing the coefficients on both sides of (7), we have 

A\+A\ = \ 

(9-y^)^?+(9+y^^i=o. 

This determines A\, A\ uniquely, and so A^, A^, B^, B^ since they are 
all real. 

Hence there are only two lattices of the type specified in the theorem, 
namely those with base 

A = (A„A^), B=(B„B,). 

and 

A = {A„Ai). B = {B,.B,). 

respectively. 

The approximate values are 

^1=1.014, yl2=- 0.347 

5i=f- 0.017, 52=1.0005. 

All we shall in fact use are the inequalities 

^i>l. ^.<0, 1 ^3^ 

5j<0, B^>i. j 

The signs of A 2 , B^ are easy to establish and, since 
Al + Al = Bl+Bl^i, 



by (7), the rest follows. 

Comparison of discriminants on both sides of (7) gives 
27(^i52-^2 5i)« = 23, 

and so 

A I B^ A 2 5j = it ^o> 

where in fact the -f sign holds, but we do not use this information. 

Let X = xx be the transformation of the plane X — (Xj , into 
the plane a; = (x, , x^ given by 

Xj = A-^^Xi 5j X 2 , 



X2 — .d 2 Xj B2 X2 ■ 
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Then the region t of points t is given by 

\xl — x^xl — xl\ <1. 

Further, A is a lattice of determinant 

(Z(t-iA) =ldet(T)|-itf(A) =1 (9) 

(cf. Chapter I, § 3). 

The region is shown in Fig. 8. The line Xj = 1 touches /q {x) = 
xf — x^xl — xl = f at ^2 = 0 and meets it again at ^ 2 =— t. The line 




%2 = 1 meets xl~ x^xl — xl = — i at Xj = 0, ± 1 . Since no line meets a 
cubic curve in more than three points, it follows readily that the whole 
of the unit square 

•2 : I 1 < t , I ^2 1 < 1 

lies in x^' except for a small region ^ in < 0, Vj < 0 and the image 
— ^ of ^ in the origin. 

Suppose first that (1, 0)ex“^A. Since d(T~^A) =1, there are points 
of x“'A on the line ^2 = 1 spaced unit distance apart, by Lemma 5- 
Since none of these can lie in x”^ the only possibility is that x“^ A = A,, , 
the lattice of points with integral co-ordinates. But then A=xAq, 
which is one of the exceptional lattices permitted by the theorem. 
Similarly, if (0, 1)ex‘^A, then A = xAq. Hence from now on we may 
assume that 

( 1 , 0 )ax-iA, (0, 1)dx-iA. (to) 

By Lemma 7, either there is a point g=bo of A in the square J, 
or (1, 0)eA, or (0, 1)€A. But the second and third alternatives have 
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already been disposed of, and so, since q cannot lie in we may 

suppose that q is in Further, q must be primitive, since if q=kqi, 
with integral ^>1 and gi€T“^A, then q^ would lie in |^ 2 |<i 

and so certainly^ in contrary to the hypothesis that t'^A is 

,9^-admissible. Hence q is unique by Lemma 8. 

We require another point of t'^A. The tangent to /o(a;)=1 at 
(0, —1) is 



Xi 3 -*2 — 3 • 



This meets /<, (*) 



: \ again at 



-28 



25 



j . Hence all of the parallelogram 



J': 



<1, |;ri + 3;r2|^3 



lies in except for the points ±(0, 1) and a region y in Xj<0, 

Xi>0 and its image in the origin. But now, by Theorem III, there is 
a point p of t~^A in J' and, since t"^A is t“^ ^-admissible, we may sup- 
pose by (10) that p is in The point p is primitive since if p—kp^ 
with integral k>\, then p^ lies in |xi| |x 2 |^f, so p-^fT~^S^. 

An application of Lemma 8 shows that p is unique. 

We note that the point 2q—p clearly lies in %i<0, X 2 > — 1. 
Since the point q is the only point of t'^A in it follows that 2q — p 
must lie in the region /^ («) ^ — 1 . 

The next stage is to show that p and q form a basis for A. We have 



and 



Hence 



0<Pi<\, 

-1<9i<0, 

det(p, q)=Piqi-qip 2 



%<Pi<-\, 

- 1 < 92 < 0 . 



( 11 ) 

( 12 ) 



<0 
> - 



1.1 - 1 .|> 



3 - 



Since det(p, g) is a multiple of det(T“^A) =1, the only possibilities are 
det (p, g) = — 1 



or 



det (p, g) = — 2 . 



In the first case, p, g are a basis. Suppose, if possible, that det (p, g) 

— — 2. Since p is primitive, there is a basis p, r, where det (p, r] 

= ± (A) = ± L Write 

' ' q—up+vr 



where u and v are integers. Then 

det (p, q) =v det (p, r) , 

^ For then |/o(q,)| ^|;fiP-|- |%l |^2|“+ explicitly give 

such trivial estimations later. 
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so j; = ±2. Now u must be odd, since q is primitive, and so 
<=i(p- 9 )€T-iA. 

But then, by (11) and (12), 

0<<i<1, — 1<<2<0; 

a trivial estimation shows that |/o(t)l<1. and so t would be in 
contrary to hypothesis. 

To sum up: there is a basis q^ 0 t of t'^A. The point 2 q — p 
lies in /„ (a?) g — 1 . There are no other points of A in or 3 i. 

We must now translate our facts about t~'A into facts about A. 
We write 

The region is bounded by the curve 

Xl + Xl = i, 

the transform of f^{x) =1, and the line-segment joining the points 
t(0, — 1 )=jB, t(— 1, — 1) = — 4 -|- B, 

and so is roughly as shown in Fig. 7. The point 

Q (say) = rq 

lies in t^. 

Similarly is bounded by Z? + = 1 and the tangents at 

t( 0, — 1) =B and at t(1, 0) =A. We now show that x.^lies in 

0<Zi<1, 0<Z2<1. (13) 

Indeed, since B2>1, the tangent to Xl+Xl = i at B has negative 
gradient and so meets Z?+Z| = 1 again at a point in (13). Since 
lies below this tangent, its points satisfy Z2<1. Similarly, since Ai>i, 
the points of satisfy Xi<i. They clearly satisfy Zi>0, Z2>0. 

But now we saw earlier that there is only one point, P, of A in (I3). 
Since zp is in we must have 

xp =P. 

To sum up the results of our translation: there is precisely one point 
^6 A in x^. This point Q together with the unique point P of A in 
(13) form a basis for A. The point 2 Q — P lies in — 1. 

Let be the mirror image of x^ in A\ + Z2 = 0. By symmetry 
there is precisely one point L, say, of A in : this point together with 
— P forms a basis for A, and the point 2L-|-P lies in Xl-\-Xl^\. 

Cassels, Geometry of Numbers 7 
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But now every point of the triangle oLQ is in one of the regions 
y, and By hypothesis there is no point of A in 6^, and we 
proved that Q,L are the only points of A in respectively. 

Hence Q, L forms a basis of A by Lemma 6. 

We have three bases P, Q Q.L and L, —P for A and must study 
their relations. Now 

det (P, Q) = det {Q, L) = det [L, -P) =d {A), 
since the determinants are and are clearly positive. Write 

P — uQ V L. 



Then 

det(P, Q) =vdetfL, Q) , 
det (P,L) =«det(P, Q). 

Hence 

P = Q-L. 

We have now reached a contradiction, since 
2Q - P = 2L + P, 

and this point has been shown to lie both in and in 

— 1. The contradiction shows that there are no ,9^-admissible 
lattices with d{A)—A(, except those mentioned in the enunciation of 
the theorem. 

We have shown rather more. Let the line joining B and —A + B 
(which forms part of the boundary of t^) meet Xi + AT 2 = 0 in the 
point ( — c, c). Then it is clear that our argument shows that there is 
a point of every lattice A with if (A) ^ Ho in the bounded region 

\Xl + Xl\<i, max{iX,|, 

except when A is one of the two critical lattices. That is, +X|| < 1 
is boundedly reducible and indeed fully reducible in the sense of 
Chapter V, § 7- 

III. 7. Representation of integers by quadratic forms^. In this 
section we digress to present a number of results in the arithmetic 
theory of quadratic forms which can be proved very simply by the 
methods of the geometry of numbers. The principle tool is the following 
lemma. 

Lemma 9. Let n,m,k^, be positive integers and {\^i^m, 

he integers. The set A of points u with integral co-ordinates 



* This section is not used later. 
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satisfying the congruences^. 

2 =0 (*i) 

zs a lattice with the determinant 



That A is a lattice follows, for example, from Theorem VI. Two 
points u and v of the lattice Aq of all integral vectors are in the same 
class with respect to A if and only if 

Z a./M, = Z {k,) (1 g I ^ w) . 

i i 

Hence the index / of A in Aq, that is the number of classes, is at most 

nk„ so 

d{A)=Id{\)^Y]ki 

% 

(compare Lemma 1 of Chapter I). 

III. 7. 2. As a first example we show that every prime number 
^ = t ( 4 ) is the sum of the squares of two integers. For then, as is well 
known, there is an integer i such that 

-|- 1 = 0 ip). 

The set of integers (Mj , Mj) such that 

u.,= iu^ (P) (1) 



is, by Lemma 9, a lattice A of determinant d(A)^p. Hence by Min- 
kowski’s convex body Theorem II there is certainly a point of A in 

the disc ^22 

x\ + x\<2p 



of area V[Si) =2np>'2?i(N). Hence there are integers u^, not both 0 
satisfying (1) and 

2p . 



But (1) implies 

ul-\-ul= u\(\ + i^) =0 (p), 



and so u\-\-ul—p, as required. The method is readily extended to show 
that a positive integer is the sum of two squares provided that it is 
not divisible by a prime p= — 1(4). 

III.7.3. As a second example, we shall show that every positive 
integer m is of the shape 

m = u^ U2 -f* W'3 -j- W4 
1 By a~b {/i) we mean that a — 6 is divisible by k. 

7 * 
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with integral Ul,u^,u^,u^. We may suppose without loss of generality 
that m is not divisible by a square other than 1, so 

■>n=Pi---Pg, 

with distinct primes p^, ...,p^. We now show that to every prime p 
there exist integers Up , bp such that 

^* + ^^ + 1=0 (p) . (1) 

Indeed when p is odd the numbers 

[0^a<\p), (2) 

and 

(0^6<i^) (3) 

are each a set of + integers which are incongruent modulo p. 
Since there are only p classes modulo p, there must be some integer c 
which is congruent to an element of each set (2) and (3), that is 

ap = — 1 — so + + 1 = 0. If ^ = 2, then = t, h^=0 will do. 

We now consider (cf. Davenport 1947b) the lattice of integral 
M = («i, which satisfy the 2g congruences 

u.^ = UpU^ + hpU^ (/>) I 

^i^bpU^ ^^^4 (p) J 

iov p=Pi, ...,pg. By Lemma 9, these form a lattice A of determinant 
d (A) ^p\...p'\ = nfi. 

Hence there is a lattice-point other than o in the set 
xl + xl + xl + xl< 2m 

of volume 

^7t^(2w)^ > 2^m^ 2*d(A) . 

If u is this point, then 

0 < «i -b -f «3 -|- «4 < 2m 

and, by (1) and (4), 

-f- «2 + -|- «4 = {up -f ip -b 1 ) «3 (flp -b ip + f) “4 = 0 ip) 

iov p=pi, ...,pg] that is m divides Mi -1 +u\. This proves the result. 

III. 7.4. A famous theorem of Legendre states that a ternary 
quadratic form f[x^, x^, x^) with rational coefficients represents 0 if 
obviously necessary congruence conditions are satisfied. Following 
Davenport and Marshall Hall (1948a) and Mordell (1951a) we 
verify this in a particular case, to which indeed the general case may 
be reduced by simple arguments. 
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^ / / \ 2 2 2 

f{x) = + a^X2 + a^xi, 

where a^, a^, are square-free integers no two of which have a common 
factor, so is square-free. We show that there exist integers u=|=o 

such that /(u) =0 provided that the following two -conditions are 
satisfied 

(i) there are integers A^, A.^, A^ such that 

a^ + A\a 2 = 0 [a^) , = 0 («i) . + A\a^ = 0 [a^] 

and 

(ii) there are integers , 1^3 not all even such that 

ttj Vi -|- ^2 ^2 “h ^3 = 0 (2^'*''*), 

where X = \ or 0 according as 01^2^3 is even or odd. 

Let 

1 « 1 « 2 « 3 | =' 2 .^Pi---Pi 

where p^, ■■■,pg are distinct odd primes and X = i or 0. We shall take 
for A the integral vectors u = (mj , «2 . *<3) satisfying the following con- 
gruence conditions. 

(I) Let p be one oi p^, ...,p^. By symmetry we may suppose that 
^3 = 0 {p). We impose the condition 

i<2 = -dsMi {p ) . 

Then 

a^u\ -b fl2«2 + « 3«3 = + «2«2 = («1 + « 2 ^ 3 )Mi = 0 [p] . 

(II J Suppose A = 0, so a^,a2,a,2 are all odd. Now 

= 0 or 1 [ 2 ^) 

for any integer v. In condition (ii) precisely one of v^,V2,V2 must be 
even, say Vg. Then 

0 = «iV? + a2i'l + <*3*^3 = «1 + «2 (2^)- 

We impose the two congruences 

= «2 ( 2 ) , 1 
«3 = 0 (2) . I 

Then 22299 

aiMl + a 2 « 2 -l-a 3 M 3 = aiMl-|-fl2M2 = 0 (2=*). 

(II^) Suppose A = 1, so one of a^, a^, is even, say a^. Then 
aii^i +a2v\ is even, so i;j, V2 are both even or both odd. If v^, V2 were 
even then „ „ „ 

« 3«^2 = — — « 2*^2 
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would be divisible by 2^, so would be even. Hence in (ii) must 
be odd, and 

0 = aivl + a 2 v\ + a^vl = a^-\-a^+ a^v\, (2*) 

since (2®) if v is odd. We impose the two conditions 

Ml = Mj (2®) , 

% — ^3 ^1 ( 2 ) • 

Then it is readily verified that 

fliMf + MaMa + MaMf = 0. (2®) 

In any case the lattice A of integers u has determinant 

d(A) ^ 2^+^pi-..pg = 4|Mi«2a3| , 

and the congruence conditions imply that 

ajM? + MjmI + AjmI = 0 (mod 2^+^pi...pg = 4|mi«2^3|) • 

But now, by Minkowski’s convex body theorem, there is a lattice 
point not o in the elhpsoid 

1m]| X? + |a2l ^2 + l«3l ^3< 4lai«2«3l 

of volume 

V{(^) = Y ■ 2®|ai«2‘*3| > 2® if (A). 

If M=|=o is the lattice point in we must have MimI +« 2 W 2 + « 3 m| =0, 
since it is divisible by 4aiM2'*3i ^^d 

|«iMf + a2«2 + a3«3| ^ + \ h \'^\ + |«3|*<3<4|aia2a3l. 

We conclude with a couple of remarks. An obviously necessary 
condition for solubility of HiM® d-agwi + « 3 M 3 =0 is that Oi, Uj- ^3 should 
not all have the same sign. We did not use this at all. Hence this 
condition must be derivable from the others. The reader might verify 
that this can be done by means of the law of quadratic reciprocity. 

In the second place we have not merely shown the existence of a 
solution, but we have shown that there is one which satisfies 

\aj\u\ + \ai\u\ + \a^\ul<A\aia^a3\. 

The right-hand side here may be improved to 2^|uia2«3| by the use 
of the precise Theorem III of Chapter II instead of Theorem II, as the 
reader can easily verify. 
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Chapter IV 

Distance-Functions 

IV. 1. Introduction. In this chapter we introduce a number of 
concepts which are useful tools in all that follows. 

IV.1.2. A distance-function F{x) of variable vector x is any function 
which is 

(i) non-negative, i.e. F(x)^0, 

(ii) continuous, 

and 

(iii) has the homogeneity-property that 

F(tx) = tF{x) {t ^ 0, real) . 

The set defined by 

F(x)<\ (1) 

turns out to be a star-body in the sense already introduced in the last 
chapter ; that is, the origin o is an inner point of and a radius vector 

tx^ (0^t<oo) 

either hes entirely in [which happens when F{x^) =0] or there is a 
real number fo ={-^(®o))'^>0 such that tx„ is an interior point of, a 
boundary point of or outside of according as t<tQ, t = t^ or t>iQ. 
In § 2 we examine this relationship and show that conversely every 
star-body 6^ determines a distance function F{x) such that the set (1) 
is the set of interior points of Since many, though not all, of the 
point-sets of interest in the geometry of numbers are star-bodies, the 
concept of distance-function plays an important role. 

Most of the problems considered in Chapter II relate to star-bodies; 
and then it is easy to write down the corresponding distance functions. 
For example if f{x) is a positive definite or semi-definite^ form, the set 

f{x)<\ 

corresponds to the distance-function 

F[x) = {f{x)f', 

where r is the degree of f[x). Again, if f{x) is an indefinite form of 
degree r and ^ > 0 is a number, then the set 

— \ <f{x) <k 



1 By semi-definite we mean that f(x) ^0 for all x but /(x) = 0 for some x#=o. 
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corresponds to the distance function 

1 |/(*)|i/' if /(a;)^0. 

The reader will readily verify that both the functions just defined 
are in fact distance-functions. One advantage of introducing distance- 
functions is that some of the ideas of Chapter II can be carried over 
to all star-bodies. A simple example of a 2-dimensional set which is 
not a star-body is 

0 < Xa < 1 . 

Clearly star-bodies if which are symmetric, i.e. have the property 
that —xiS^ when x<^ 9 ’ correspond to distance-functions which are 
symmetric in the sense that 



F{-x) =F{x). 

K. Mahler (t950a) and C. A. Rogers (1952a) have investigated a 
wider class of sets which Rogers calls star-sets and which include the 
closed star-bodies as a sub-class. A star-set is a closed set such that 
txeS^ whenever O^^^l and x^SF. They are important in connection 
with certain problems (“bounded reducibility” cf. Chapter V, § 7) and 
we shall mention them again; but we refer the reader to the original 
memoirs for the details. 

IV.1.3. Convex sets JT are important as Minkowski’s convex-body 
theorem shows. It turns out that the convex sets which have the origin 
o as an interior point are precisely the star-bodies whose distance- 
function satisfies the inequality 



F(x + y)^F(x) +F{y). 

This we prove in § 3. We call such distance-functions convex. 

In § 4 we show that an w-dimensional convex set Jf has a tac- 
(hyper)plane^ at every point a on the boundary of JJf; that is a 
(hyper) plane 

it; hx^ + --- +p„x„ = k 



which passes through a and is such that JT hes entirely on one side 
of or in tt; say 

PlXl^ \-p„x„^k (all JT) . 



Clearly if there is a tangent plane to Of at a, then it is the only tac- 
plane. But tac-planes exist even when tangent planes do not, and they do 



1 We use the words tac-plane and plane for tac-hyperplane and hyperplane. 
When M = 2 the corresponding thing is called a tac-line. The term supportplane 
(German: Stiitzebene) is sometimes used. 




General distance-functions 



105 



not need to be unique: for example when a is a corner of the square 
lXil<t, 1X2|<1. 

In discussing tac-planes it is convenient to introduce the polar 
body of a convex body; a notion which we shall in any case require 
in Chapters VIII and XL Any plane rr not passing through the origin 
can be put in the form 

•n-: yiXi-i hy„x„ = 1, 

and so may be represented as a point y = (yi, y„) in «-dimensional 
space. It turns out that the points y corresponding to planes rr which 
do not meeti JT themselves form a convex set Jf*, say, the polar of Jf. 
Further, the relationship between Jf and Jf* is reciprocal in the sense 
that Jf may be obtained^ from Jf* in the same way as Jf* was obtained 
from jT. 

An example of a pair of polar bodies are the generalized cube 

max|xy|^l 

and the generalized octahedron 

It is easy to see that a plane Y^y^x— 1 for fixed y can contain a point 
of the interior of ^ only if y is not in and vice versa. We discuss 
polar sets in § 4. 

There is a rich theory of convex sets but we do not prove more than 
is relevant to the geometry of numbers. For the rest the reader is 
referred to the report of Bonnesen and Fenchel (1934a) or Eggle- 
ston’s tract (1958a). 

IV. 2. General distance-functions. We set up now the relationship 
between distance functions and star-bodies sketched in §1.2. 

Theorem I. A. If F(x) is any distance function then the set 

SF-. F(a;)<1 

is an open star-body. The boundary of is the set of points x with 
F{x) = 1 and points with F{x)>i are exterior to ( that is, have a neigh- 

bourhood which does not meet SF). 

B. Conversely any star-body determines a unique distance-function 
F(x). If ^ is the set of interior points of ^ then If is related to F[x) 
in the way described in A. 

1 We say that two point-sets meet if they have a point or points in common. 
* Strictly speaking the set JT** obtained from jf* coincides with JT except 
possibly on the boundary. The distance-functions of and Jf** are thus the same. 
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We note first that two distinct star-bodies ^ and determine the 
same distance-function ¥[x) if they have the same set of interior points, 
but a distance-function defines precisely one open star-body, namely 
F{x) <1 and one closed star-body, namely F(*)g 1 . Distinct distance- 
functions F, , Fj always determine distinct star-bodies. For then F^(Xq) 4 = 
Fj (Xq) for some Xq , say F^ {x„) < Fj (Xq) ; and then there is a t such that 

[t X(y) < 1 < (i Xq) , 

SO tx^ is in one star-body but not the other. 

The proof of Part A of the theorem is nearly trivial. If F(aCo)< 1 , 
then, by the continuity of F(x), there is a neighbourhood 

\x — a^ol <?y 

of Xq which lies in y”; so x^ is an interior point of Here we have used 
the standard notation 

1*1 = -I +xl)^- 

Similarly, if F(Xo)> 1 , then there is a neighbourhood of Xq which does 
not meet 6^. Finally, if F(Xo)=t, then every neighbourhood of Xq 
contains points tXg both with f>l and t<i, for which F(ix„)>\, 
F{iXg)<\ respectively: and so x^ is a boundary point of 

It remains to prove B. If y* is any star-body, we define a function 
F(x) as follows: 

(a) F(x) =0 if tx€^ for all t>0. In particular F(o) = 0 . 

(/?) If tx is not in ^ for all t>0 then, by the definition of a star- 
body, there is a t„=tg{x)>0 such that tx is interior to or exterior to y 
according as t<tg or t>io', and t^x is on the boundary of y. We put 

F(x) ={^,(x)}-i. 

Clearly, if F(x) is a distance function, then it is related, to the set y of 
interior points of y in the way described. Further, it follows trivially 
from the construction of F(x) that it satisfies two of the defining pro- 
perties of a distance function, namely F(x )^0 and F(tx) =tF(x) for 
all t>0. It remains only to show that F(x) is continuous. 

We show first that F(x) is continuous at o. By the definition of a 
star-body, the origin o is an inner point of y , so there is an rj>0 such 

that the sphere . , ^ 

|x|^?7 

is contained in y. Hence, if Xq=1=o, the vector 

t Xq, t = T]l\ XqI 

is certainly in y, so 

F(Xo)^yi|xo|. ( 1 ) 
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Since r] is independent of Xq, this proves the continuity of F{x) at the 
origin. 

We now prove continuity at a point Let e>.0 be arbitrarily 

small. The point 

^i = {^(*o) +er‘»o (2) 

is an interior point of ^ by the definition of F; and so there is a 
neighbourhood 

(3) 

which lies in that is, (3) imphes 

F(x) ^ 1 . (4) 

Write 

= {F{Xo) + Vi = {^(*o) + ey^Vi- 

Then (3) is equivalent to 

|y-»o|<>?2 (5) 

and, by the homogeneity property of F{x), which we have already 
proved, the inequality (4) is equivalent to 

F{y)^F{x^) + e. (6) 

We have thus found a neighbourhood (5) of x^ in which (6) holds. 
It remains to find a neighbourhood of points y in which 

F{y)^F{Xo) - (7) 

If F(Xq)-^s, then (7) is true for all y, since F{y)^0. Otherwise one 
considers the point 

x^ = {F(Xo) - e}-^x^. 

This is an exterior point of ^ and then the argument goes as before. 
This completes the proof of the theorem. 

There is the trivial corollary of which we leave the proof to the 
reader. 

Corollary. Let Fj(x) and F^lx) be distance functions. The star-body 
73 (a;) < 1 is a subset of the star-body F^ {x) < 1 if and only if 

Fyx)^Fyx) ( 8 ) 

for all X. 

We record for later reference two results, the first of which we have 
already proved. 

Lemma 1 . For every distance-function F(x) there is a constant C such 

that 
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Lemma 2. A necessary and sufficient condition that the star-body 
m <\ be bounded is that F(x)4=0 if aj=t=o. There is then a constant 
OO such that 

F(ac)^c|a:l (9) 

for all X. 

We proved Lemma 1 above with C = at least when ac=j=o; and 
it is trivial when x=o. If there is a x^=^o with F[x^ =0 then tx^ 
lies in F(ac)<1 for all t>0, so F{x)<i cannot be bounded; which 
proves half of Lemma 2. Suppose conversely that F(»)4=0 if x=^o. 
The function F{x) is continuous on the surface of the sphere |a;| =1 ; 
and so attains its minimum, say, at x^. Then F(a;o)>0, by hypothesis. 
Put F{x^)=c. Then F{Xo)^c if |a;|=1; and so (9) holds by homo- 
geneity. This completes the proof of Lemma 2. We note that if (9) 
holds, then F(x)<t is entirely in the sphere |a;|<c'^. 

The following trivial corollary rids Lemma 1 of its dependence on 
the particular distance-function |ar|. 

Corollary. Let be distance- functions and let 

be a bounded set ( i.e. F^(x) =0 only for x = o). Then there is a constant C 
such that 

Fi[x) ^ CFi(x) 

for all X. 

If, further, F,^{x)<\ is bounded, then there is a OO such that 
CFi(x) ^ F,^[x) ^ cFi(a:) . 

The second part of the corollary may be picturesquely summed up 
m the slogan "for quahtative purposes there is only one bounded 
star-body”. 

IV.3. Convex sets. A set JT is convex if 



tx-\-(\~t)y (0<i<1) (1) 

is in whenever x and y are in JT. It is said to be strictly convex 
if the points (1) are all interior points of Jf. 

We show first that if x^, ...,x^ are any points of JT and 



then 



Xj T ‘ tf Xf € . 



( 2 ) 



This is true for r = 2 by the definition of convexity, and it is true for 
y>2 by induction, since we may suppose that <i=|=l, and then 



qxi H h t,x, = /jXi +[^-k)y, 
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where 

M = —X, H + — ~X.€Jf, 

since it is of the shape (2) with r — \ summands. 

Almost immediate consequences are 

Lemma 3- A convex set Of in n-dimensional space either lies entirely 
in a hyperplane 

tt: piXi + --- +p„x„ = k 
or it has interior points. 

Lemma 4. A convex setJf with a volume L'(Jf) such that 0<V{Jf) < oo 
is bounded. 

For if jT does not he in a hyperplane it contains « + 1 points 

which do not lie in a hyperplane. The points 2^/®/ with — ^ 

are just the points of the simplex with vertices x^, The 

whole simplex must be contained in X', and since a simplex has interior 
points this proves Lemma}. 

In Lemma 4, we note that X cannot he in a hyperplane if F(JT) >0; 
so we may suppose without loss of generahty after a change of origin 
that o is an interior point of JT. There is then a number rj>0 such 
that ah the vectors 

j-i 

rje^ = (0, . . . ,0,rj, 0, ... ,0) (i ^ ^ n) 
are in X~. If a = (a^, ..., a„) be any other point of JT, we shall show that 
maxla,! ViX). 

l£;Sn ' 

If, say, ai=t=0, then the whole of the simplex with vertices o,a,rie^, 
...,t]e„ is contained in X and has volume 

Since this can be at most V(X), the result foUows. 

Finally we prove 

Theorem II. A convex body X of which o is an interior point is a 
star-body. The corresponding distance function F{x) satisfies the inequality 

F{x + y)^F{x)+F[y) (3) 

for all X and y. 

Conversely if F{x) is a distance function for which (3) holds, then the 
star -body 

F{x)<\ (4) 



ts convex. 




no 
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The converse is trivial. If F{x)<\, Fiy) <1 and 0<f<1, then the 
inequality (3) applied to tx and [\ — t)y gives 

F{tx + {\ -t)y}^F(tx) -f T'ill -t)y) 

= tF[x) F - t)F{y) 

<t + {\~t) 

= t. 



It remains, then, only to verify the direct assertion of Theorem II. 
We define a function ^(£15) as follows: 

F{x) = inf (5) 

where the infimum is taken over all t such that 



f>o, txeJT. 



( 6 ) 



Since o is an interior point of Jf, there certainly do exist t satisfying (6). 
It follows at once from the definition that F[x)^Q, F(o) =0; and that 
F{sx) =sF{x) for all s^O. Thus F{x) will be a distance-function if 
we can prove continuity. We first prove the functional inequality (3) 
and then deduce continuity from (3). 

Let x,yhe any two vectors and s, t any two positive numbers such that 



Then 



sxeJF, tyeJT. 
rsx -f (1 — r) tyeJf 



( 7 ) 



if 0<r< 1. 
Then 

so 

Hence 



We choose r so that this point is multiple of x-\-y, Le. 
rs~H—r)t; r = tl{s-\-t). 

^(a; + j/)6Jr ; 

F[x + y) g ~~ = s-i -f rb 

F{x + y)^F[x) +F[y) 



since F[x),F{y) are the infima of over s,t respectively which 

satisfy (7). 

The function F[x) is continuous at o by the same argument as was 
used for distance functions. Since o is an interior point there is a neigh- 
bourhood , , 

\x\^rj, rj> 0 

of o contained in Jf, and so 




Convex sets 



111 



The continuity at a general point is now immediate. We have 
F{Xa + y)^F(Xo) +F(y), 

and 

F{Xo)^F(Xo + y) +F(-y). 

Hence 

\F[^o + y) -F(Xa)\^maxF(±y)^ri^^y\ < e 

for any given £>0, provided that \y\<r]e. 

Finally we must verify that the set 

F{x) < 1 

is in fact the set of interior points of Jf. A point x with F{x)<i is 
certainly in JT since, by the definition of F{x), there is a f> 1 such that 
txeJf; and so 

X = (tx) + (1 — o 

is in by convexity. Since F is continuous, the set F(x)<z\ is open; 
and so all its points are inner points of JT. Conversely, if x is an inner 
point of Jf, there is a f>1 such that txiJf, and so F(x)<i by the 
definition of F{x). From the definition of F, no point x with F{x)> 1 
can belong to JT. Points with F(x) = 1 may or may not belong to 
but, since F[x) is a distance-function, they must be boundary-points 
of jr. 

For later reference we enunciate formally a result we have just 
proved ; 

Corollary. Let F{x) be a non-negative function of the vector x which 
satisfies the two conditions 

F{tx)=tF{x) if t>Q, 

F(x + y)^F[x) +F{y), 

and which is continuous at o. Then F(x) is continuous for all X] and so 
is a distance- function. 

IV. 3.2. The next lemma is an essential preliminary to the treat- 
ment of polar bodies and tac-planes. 

Lemma 5^. Let be a closed convex sets having no point in 

common. Then there is a hyperplane 

TT : P\X^-\- ■■■-{- p^x^ = k 

which separates and jCj : that is all the points of Jfj are on the opposite 
side of TT from those of JT,. 

Proof given is valid only if at least one of jTj, is closed (as otherwise there 
need be no minimum distancej. 
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Consider the distance |a5i— aigl when run through the points 

of respectively. Since and are closed, this distance 

attains its infinum at some points x'jeJfj {j = 2); and x'i^x '2 since 
Jfi and .^2 have no points in common. We show that the hyperplane it 
which bisects perpendicularly the Une-segment Xix '2 will do what was 
required. After a suitable rotation of the co-ordinate system and a 
possible change of origin we may suppose that 

x'i = ...,0) , X 2 = {rj,0, ...,0) 

for some rj>0. The plane it is then 

ir; %i = 0. 

Suppose, if possible, that there is a point z in with 2l^0. By con- 
vexity, the point 

Zt = (\ — t)x[ + tz (0<<<1) 

is in The distance |«(— is given by 

= 4??“* — 4{r] -h 2 i) f -b 0 {fi) < Arf, 

if t is small enough and strictly positive. This contradicts the definition 
of x'l and x' 2 . The contradiction shows that z cannot in fact exist, and 
so proves the lemma. 

Corollary. // jT is a convex closed set and a a point not in jT, 
there is a hyperplane separating Jf and a. 

For we may put and take 3^2 fo he the set consisting of a 

alone. 

IV.3.3. In introducing the polar set of a given convex set JT, we 
confine attention to the case when Jf is bounded and can be described 
by a distance function; that is o is an inner point and 0<F(JT)<oo 
by Lemmas 3 and 4 and Theorem II. If the reader is interested he will 
have no difficulty in extending the results to the other cases using 
Lemma 2. 

We write 

P«=A«i-l !-/>„«„ 

for the scalar product of two vectors p and a. 

Theorem III. Let F(x) be the distance-function associated with a 
bounded convex set. For all vectors y let 



( 1 ) 
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Then F*{y) is the distance-function associated with a bounded convex set. 
The relationship is reciprocal in the sense that 



F[x) = sup 
w + o 



F*{y) • 



( 2 ) 



The functions F and F*, or the convex sets associated with them, 
are said to be polar to each other. 

We must first show that F* is well-defined. Since the body F(a5) < 1 
is bounded, we have F(£c) 4= 0 if a: #= o by Lemma 2, and indeed there is a 
constant OO such that F(x)^c\x\. Since xy^\x\\y\, it follows that 

F*(y)^c-i|y|. ( 3 ) 



Immediate consequences of the definition are that 
F*(iy) — tF*{y) if <>0, 

and 



F*(y)>o if y + o. 



( 4 ) 

( 5 ) 



Now if i/j, 1/2 are any vectors, we have 

f •(». + »,) - »P S sup Hi + sup ^ 

= F*(i/i) +F*{y^). 



( 6 ) 



But now (3), (4), ( 5 ) and (6) show that F*(y) is the distance-function 
of a convex set, by Theorem II and its Corollary. This convex set is 
bounded because of ( 5 ) and Lemma 2 . 

It remains only to prove (2); and here we need the convexity of 
F{x), which we have not yet seriously used. If a: =0, then (2) is trivial, 
so let aio 40 be fixed. From (1) we have 

F(x)F*{y)^xy (7) 

for all X and y : and so certainly 

^W^sup3^. (8) 

Let e> 0. Then by Lemma 5 Corollary there is a hyperplane tr separating 
aij from the set of x such that 

F(a:) ^ (I - £)F(»o). (9) 



Since tt does not pass through the origin, it may be written in the shape 

tt : xyo = \. ( 10 ) 

Then F{x)^{i — e)F{x^) for all points x on tt, since tt does not meet 



(9) ; hence 



F*{yo) ^ 



1 

(1 -«)F(a:„) ’ 



( 11 ) 
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since one need clearly only consider the x with xy = 1 in (1), by homo- 
geneity, if 1/ =i= o. Further, 

(f2) 

since x „ is on the other side of it from the origin, which is a point of (9). 
From (11) and (12) we have 

The required result (2) now follows from (8) and (13), since e is arbi- 
trarily small. 

This concludes the proof of the theorem. The reader will be able 
to verify readily that the sets F(a;)<l and F*{y)<i are related to 
each other in the way described in § 1.3- 
We have at once the 
Corollary 1 

F{x) F*(y)^xy 

for all X, y For any i/o=t= o there is an a;(,=)= o such that 

P(Xo)F*{yo) =x„yo', (14) 

and vice versa. 

We have already noted the first inequahty, which is an immediate 
consequence of the definition. By symmetry it is enough to show the 
existence of x^, given y„. The set ^ of points x with F{x)—\ is 
bounded; and it is closed since F(ac) is continuous. Hence the continuous 
function xy^ attains its upper bound, say at ajo- But we have already 
seen that the upper bound is F*(i/o). so (14) must hold. 

We also shall need later 

Corollary 2. Let convex sets with non-zero volume having 

the origin as inner point and with respective polar s Jf* ahd . If ..tTi 
contains Jfj then contains 

Let the corresponding distance functions be F^[x), F.^{x), F*{x), 
F*{x). Then F2 (®) ^ Fi (ac) by Theorem I Corollary. The definition (1) of 
the polar distance-function then gives immediately F2*(i/) ^F*{y) for ally. 

The following corollary links polar distance-functions with the polar 
lattices and transformations introduced in Chapter I, § V. 

Corollary 3. Let F[x),F*(y) be a pair of mutually polar convex 
distance- functions. Let x be a homogeneous linear transformation and t* 
its polar transformation. Then F{xx) and F*{x*y) are mutually polar. 

For by the definition of t* we have xxx*y —xy for all x, y. The 
truth of the corollary now follows from (1) and (2). 
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IV.3.4. A hyperplane tr through a point on the boundary of a 
convex set Ot is said to be a tac-plane to at x„ if no interior point 
of JT is in TT. The following Theorem IV is an almost immediate con- 
sequence of the results of § 3 . 3- We shall need Theorem IV in the next 
chapter, but §3-3 only in Chapter VIII. 

Theorem IV. Lei JT be any convex body with volume F(JT) such 
that 0<V{Jf)<oo. Then at every point Xq on the boundary of Cf there 
is at least one tac-plane. There are precisely two tac-planes to parallel 
to any given hyperplane tt. 

We may suppose that o is an interior point of Let F{x) be the 
corresponding distance function. Then T'(®o)=L By Corollary! to 
Theorem III there is a J/o=t=o with 

* 0 1/0 = -P («o) P* (j/o) = -P* (?/o) ■ (1 ) 

The plane 

•n'; xyo=F*{yo) (2) 

thus passes through x^. By the Corollary ! to Theorem III we have 

xy„^F{x) F*{yo), 

so F{x)^\ for all points of it'. Hence it' contains no interior point 
of jT, so is a tac-plane. 

Any plane (2) for fixed y^ is a tac-plane at some point x^^. For by 
Corollary 1 to Theorem III there is an x^ such that (!) holds. 

Hence if y^ is any vector, the two planes 

xyo = p*{yo) (3) 

and 

^ya = -F*[-y^) (4) 

are both tac-planes. It is clear that they are the only tac-planes parallel 
to xy^=0. The origin lies between the hyperplanes ( 3 ) and (4), and 
hence so does the whole of the interior of JT. 

IV. 3.5. In Chapter IX we shall need the following result. 

Lemma 6. Let and Jfj be open convex sets in n-dimensional space 
with 

0<F(Jf))<oo (/ = !,2). 

Suppose that and JTj have no points in common but that a is a boundary 
point of both and Then there is a hyper plane through a which 
does not meet either JTj or JTj {and so is a tac-plane to both and Jf^). 

The proof follows that of Theorem HI. We may suppose without 
loss of generality that o is an inner point of Let h be an inner 

8 * 
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point of ■ Then and JTg may be described by distance-functions : 
Jjfj : (ac — 6) < 1 . 

For J<<<! let (/ = !, 2) be given by 

so that :X~‘ is a closed subset of By Lemma 5 there is a plane iri 
separating JT/ and Since -rr* does not pass through the point 

0 € it has an equation 

2 

Since t>\, the set C^l contains a neighbourhood of the origin, 

where rj>0. Since no points of this neighbourhood lie on iri, we have 

\pj,\^rj-^ ( 1 ) 

Since b is on the opposite side of Td from o, we have 

ZPi,bi>i. ( 2 ) 

1S;S» 

By (1) and Weierstrass’s compactness theorem, there exist p] which 
are the limits of as t tends to 1 through a sequence of values 
k<k< --<^m<--- which is the same for each j. By (2) not all the p] 
are 0. The plane tt' defined by 

2 P'i = 1 

i 

clearly has all the properties required. 

IV.3.6. The results of the rest of this § 3 will not be required until 
Chapter VIII, but it is convenient to give them here. They show that 
any two symmetric convex sets and jTj with finite non-zero volumes 
behave similarly. 

For more precise results, generahsations to convex sets which are 
not symmetric, and references to the Hterature, see for example Bambah 
(1955a), and for an interesting application see Mahler (1955 a, b). 

A closed “generahzed parallelopiped” in «-dimensional space with 
o as centre is the set of all points 

X=t^Xi-\ \-t„x„ (1) 

where x^, ...,x„ are fixed linearly independent vectors and t„ 

run through, all real numbers in 

max |/y| ^ 1 . 



( 2 ) 
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A closed “generalised octahedron” with o as centre is similarly the set 
of all vectors (1), where run through all numbers in 

( 3 ) 

We first prove the following refinement* of a result of Mahler 
(1939b). 

Theorem V. Let JT be any closed symmetric convex set with volume 
V ( jT) such that 0 < F ( Jf) < oo . T hen there exist points ± , . . . , i a;„ € JT 
such that is contained in the parallelopiped with faces tCj (1 ^ J'£n), 
where tCj is the hyperplane through the points Xj±_Xj (/#= /). Further, 
the generalized octahedron 2) with vertices {\^j'^n) is contained 
in JT. 

The last sentence is in any case trivial by convexity. We take for 
aCj , ...,Xn points of JT such that the volume of ^ is a maximum. Such 
a choice is possible since is closed and bounded. If JT were not 
contained in 'S, there would be a point y on the opposite side of the origin 
from one of the faces difLy, say on the opposite side of tt„. Then the 
generalized octahedron with vertices ±£Ci , . . . , ±®»-i . ± !/ would have 
greater volume than 2, contrary to construction. 

Corollary l. 

F(jr)^F(^)^(«!)-*F(jr). 

For the left-hand inequalities are trivial, and the right-hand ones 
follow from them and F(‘^)^«! V(2). 

Corollary 2. Let JT, he any two closed symmetric convex sets of 
finite non-zero volume. Then there is a homogeneous linear transforma- 
tion T of the variables such that 

and 

(«!)-* F(r) g V{xLF)^[n\) 

Let Xi, ...,x„ be the points of the theorem for Of and let y^, 
be the corresponding points for We determine t by the equations 

Ty^ = x, (1^/^m). 

Then the 'g’, 2 of the theorem are the same for and x.ST. The stated 
results are now trivial, since 

* Suggested by Professor C. A, Rogers, who disclaims originality. The same 
method proves a corresponding result for non-symmetric bodies in which the 
inscribed and circumscribed bodies are both simplexes (cf. Mahler 1950 a). There 
are also results about inscribed and circumscribed ellipsoids (John 1948a). 
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IV.3.7. As an application of the methods and results of § 3.6 we 
prove the following result about the volumes and lattice constants 
(Chapter III, § 5-1) of polar convex sets. We again denote the lattice 
constant of a set Sf by d(^). 

Theorem VI. Let and, Cf* be bounded symmetrical convex sets 
which are mutually polar. Then 

^ ^ V[X) V(Jf*) ^ 4" 

and 

02^ zi(jr) d(jr*) ^ t . 

where 0 is the lattice constant of the octahedron 2 

The first pair of inequality is Mahler’s (1939a, b) and the proof 
of the second pair is practically identical. When n~2 Mahler (1948a) 
has determined the best possible inequalities namely 

equahty on the left-hand side being necessary when is a square and 
on the right when Jf is a circle. For related inequalities and references 
to later work see Bambah (1954c and 1955a). 

We now prove the theorem for the lattice constants. The proof for 
the volumes is similar. Let t be any homogeneous hnear transformation 
and T* its polar transformation, so 

det (t) det (t*) = 1 . (1) 

The bodies tJT, are mutually polar by Theorem III, Corollary 3, 

Since 

d(Tjr) =idet(T)|zJ(jr), 

it follows from (1) that 

zi(Tjr) A(t*jt*) = A{jr) A(jf*) . 

Hence neither the hypotheses nor the conclusion of the theorem are 
affected if is subjected to a homogeneous hnear transformation and 
Jf* to the polar transformation. 

Suppose first that = Jfj, is a parallelopiped. After the apphcation 
of a suitable homogeneous linear transformation we may suppose 
without loss of generahty that is the unit cube 

1^,1 ^ 1 (1 ^ / ^ «) • 

We saw already in §1.3 that Jf* is the generalized octahedron 
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AW AW) = AW) = 0 ( 2 ) 

by the definition of 0. 

Now consider a general Jf, which we may suppose without loss of 
generahty to be closed. Let and ^ be the parallelepiped and octa- 
hedron given by Theorem V so that 

(3) 

The polar of the parallelepiped ^ is an octahedron which is inscribed 
in Jf* by Theorem III, Corollary 2. Similarly the polar of the octa- 
hedron ^ is a parallelepiped and 

(4) 

We now show that 

A{S!)^0A{’^). (5) 

where 0 is given in the enunciation. By Theorem V we have 

(«!)-! T(<^). (6) 

But every octahedron may be transformed into any other by a homo- 
geneous linear transformation, and so the ratio A{S>)IV{Si) is the same 
for all octahedra 3>. In particular, taking to be 2 |^,| < t. we have 

A(9) _ n\ 

T'iW) 2 "“ ■ 

Similarly, 

d(«’) _ 1 
F(«’) ~ ^ ’ 

and (5) follows from (6). 

But now from (3) and (4) we have 

zi{jr) /}(jr*) ^ zi(^) /}(<^*) ^0A(^) A[^*) = 0^, 
on applying (2) with Similarly 

AW) g A[^) A{2!*) ^ 0-^A(S>) A{9*) = 1 , 
on applying (2) with 

IV. 4. Distance-functions and lattices. In the further study of the 
relationship between star-bodies (and in particular convex bodies) and 
lattices it is convenient to work with distance-functions rather than the 
star-bodies themselves. We write 

F{N) =infF(a), 
as A 

0=^0 



( 1 ) 
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for any distance function F and lattice A. In the language of § 5-1 of 
Chapter III the lattice A is admissible for the star-body F{x)<k if 
k^F(/\) but not if k>F{A). 

We have 

f(fA) = |f|f(A) (2) 

for any real number f 4=0, where f A is the set of ta, o6 A. If t>0 this 
follows from the property F(tx) =tF{x) of distance-functions, and if 
t<0 from the further observation that A contains —a if it contains a. 
In particular, 

{F(fA)}’> _ {F(A)}« , , 

d{th) d{A) • 

where n is the dimension of the space. We sum up the properties of 
F(A) in the following theorem, which links our present point of view 
with that of § 5-1 of Chapter III. 

Theorem VII. For any distance function F write 

over all lattices A. Then d{F)<<x>. Further, 

d{F)={A(^)Y\ (5) 

where A{6T) is the lattice constant of the star-body 

ST \ F{x) < 1 . 

If A[6^) = <x>, then (5) is to be interpreted as d(F) =0. 

If d(F)=\=0, then the supremum in (4) may be confined to lattices 
A with F(A)>0, and then, by homogeneity, to those lattices with 
F{A) =\. Such lattices are admissible for by definition, and so they 
have d{A)^A (A) . This shows that 

d{F)^{A{y)r\ (6) 

On the other hand, if A is ,5^-admissible then F(A)^1, and since there 
are ,5^-admissible lattices A with d{A) arbitrarily close to A {ST), by the 
definition of A{ST), we must have 

d(F)^ sup {d{A))-^^{A(^)YT (7) 

A is ^-admissible 

Then (5) follows from (6) and (7). 

If d{F) =0, then F{A) —0 for every A. From (7) this can hold only 
if there are no .S^-admissible lattices, i.e. if A{ST) = oo. Conversely if 
A {ST) = oo and A is any lattice, the lattice tA is not ^-admissible, for 
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any t>0: 



tF(A) =F{tA) < i . 



Hence F(A) =0 on letting i-^oo. 
We note also the rather trivial 



Lemma 7. Suppose that the distance-function F{x) vanishes only for 
x=o. Then every lattice A contains a point a=|=o such that F(A) =F(a). 
In particular, F(A)>0. 

For by Lemma 2 there is then a number c>0 such that 

F{x] ^ c|*| . 

Hence 

F{x)^F[A)+\ (8) 

implies that 

l®|gc-i{F(A) +1}. (9) 

But now by Lemma 1 of Chapter III there are only a finite number 
of points of A for which |®| is less than a given bound, and so there 
are only a finite number of points x oi A satisfying (9). If we take 
a=ho to be one of those points for which F[a) is least, then a enjoys 
the properties required. 



Chapter V 

Mahler’s compactness theorem 

V.l. Introduction. So far we have been concerned with one lattice 
at a time. In this chapter we are concerned with properties of sets of 
lattices. We first must define what is meant by two lattices A and M 
being near to each other; and this is done by means of homogeneous 
linear transformations. A homogeneous linear transformation X = xx 
of «-dimensional euchdean space into itself is said to be near to identity 
transformation if the coefficients in 

are near those of the identity transformation, that is if 

|T,j — 1| {i^i^n) 

and 

I Tfj\ (1 ^ ^ M, i^j^n, 1 4= /) 

are all small. The lattice M is thought of as near to A if it is of the 
shape tA where t is near the identity transformation, and where tA 
denotes the set of to, a€ A. Roughly speaking, M is near to A if it can 
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be obtained from A by a small deformation of the underlying space. 
Convergence of a sequence of lattices A, to a lattice A' may then be 
defined in the obvious way. 

Minkowski (1904a and 1907a) already used the idea of the con- 
tinuous variation of lattices to show that a bounded convex set 

y-. F(x)<1, (1) 

where is the corresponding distance function, always has a critical 
lattice A^ in the sense of § 5-t of Chapter III; that is 

F{\) =^iniF{a)^\, ( 2 ) 

agAc 

and d (A,.) is a minimum ; 

dlA)=A(y]= inf i(A). 

A critical lattice has the property that if it is shghtly distorted to 
a lattice A with d{A)<d{A^) then F(A)<1 ; that is A has a point other 
than o in 6^. From this, Minkowski obtained important properties of 
critical lattices and so gave an explicit process for finding A(y) for 
convex bodies at least in 3-dimensional space. This was generahzed 
and put on a more satisfactory basis by Mahler (1946a), who gives 
general conditions under which a sequence of lattices A, should con- 
tain a convergent subsequence. In this way he showed that any star 
body F(ac) < 1 has a critical lattice if only there are any lattices A with 
F(A)>0. In an important sequence of papers, Mahler (1946a, b, c, d, e) 
extended much of Minkowski’s work on critical lattice to general 
star bodies and made other applications of his compactness criteria. 
He has also [Mahler (1949b)] considered the critical lattices of sets 
which are not star bodies, but we do not go into this here. 

In this chapter we first consider the properties of homogeneous 
hnear transformations which are needed for the treatment of convergence. 
Then we prove Mahler’s general criterion for a sequence of lattices A, 
to contain a convergent subsequence. After that, we study the pro- 
perties of critical lattices of sets taking in turn general star bodies, 
bounded star bodies, convex sets and spheres. As the sets become more 
specialized, there is more and more precise information about the critical 
lattices. Finally in § 10 we give an application to a problem in the 
theory of Dophantine approximation. 

V.2. Linear transformations. Convergence for lattices will be 
defined in terms of homogeneous linear transformations, already intro- 
duced in Chapter I, § 3. We operate in M-dimensional space with some 
fixed euclidean coordinate system. If r,y is a set of real numbers. 
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we denote by xx the transformation of our space into itself given by 
the equations 

where X=xx. We write det(T) =det(T,,). If det(x)=0, the trans- 
formation T is singular: otherwise it is non-singular and possesses an 
inverse, which we denote by x'h By o-fx, where a and x are trans- 
formations, we mean the transformation 

(o -l-x)x — ox -f xx; 

and by ox we mean the transformation 

(ox)x =o(xx). 

If o, X correspond to the matrices of coefficients and then the 
coefficients of o-fx and ox are clearly 

and 

'L(^ik-Cki. (I) 

respectively. We denote the identical transformation 
Xi = Xi (1 S g n) 

by i. 

We require a measure of the size of the coefficients of the matrix 
of a transformation x. We write 



Clearly 



Further, 



|lx|| = wmax 
o+x||^||ol| +||x 

ll®'r|| ^llo||||f|| 



( 2 ) 

( 3 ) 



since the coefficients of ox are given by (1). Further, if X = xx we 
have trivially 

max |Z,| g ||x|| max |x;| . (4) 



From this it follows crudely that 



since 



|xx| g «^||t|| |x| , 
max I I ^ I X I ^ max I x,- 

i i 



( 5 ) 



for all X. 
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We shall also need to use the fact that if t is near to the identical 
transformation i. then exists and is also near to i. This statement 
is made more precise in the following lemma. 

Lemma 1 . Let t = i + o he a homogeneous linear transformation with 



Then x is nonsingular and 
satisfies 

We note first that if p 
from (2) and (3). We have 



||o|| < 1 . 


(6) 


P = 1 — T“^ 


(7) 


Pll< """ 

Pll= 1-HI ■ 


(8) 



exists, the inequality (8) follows at once 



p = T = o — po; 

so 

||p1I^||o| 1 +||po|l^|la|| +l|p||||(x||, 

as required. 

It remains to show that x is nonsingular; and for this it is con- 
venient to use another characterization of 1 |t||. Put 

Fi(x) =n~'^'Z\Xj\, F 2 (aj) = max|x,| . (9) 

; ' 



Then (x) and F^ (ae) are convex symmetric distance-functions vanishing 
only at o, and 

F^{x)^F^(x) ( 10 ) 



for all X. Then clearly 



I"" "S w 



( 11 ) 



for all homogeneous transformations t. For any x we have, by (10), 
(11), that 

F^{x) = F^ {xx — 005) g Fi (xx) -f Fj [ax) 

^Fi{xx) +F^(ax)^Fi{xx) -f ||o||Fi(a;), 



the last hne by (11) with a for t. In particular, since ||o|]<.l by hypo- 
thesis, we have {xx) = 0 only when x = o: that is xx=o only when 
x=o: so T is nonsingular. This concludes the proof. 



Our choice of IItH to represent the “size” of t is somewhat arbitrary. If F 
is the distance-function of a symmetric convex bounded body, an alternative would 
be to use 

IMIf=sup^^^. (12) 

x+o -Cw 



The reader will have no difficulty in verifying that (2), (3) and Lemma 1 continue 
to hold when || ||p is substituted for || ||. Since we have used |x| to denote the 
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size of the vector *, it might have been more tidy to use where F„{x) = |a:|, 

to measure the size of t. We have chosen ||t|| because of its simpler expression 
in terms of the The choice of || || instead of some || ||^ is, for all essential 

purposes, irrelevant, since it follows from Lemma 2, Corollary of Chapter IV that 

where cj and c'^ are numbers depending on the particular function F, but not on t. 

We shall also need later two lemmas relating to distance functions 
and linear transformations. 

Lemma 2. Let F{x) be a distance function such that F{x) =0 only for 
x=o, and let x be a linear transformation. Then there is a number Cj 
depending only on F and x such that 



F(xx) ^ CiF{x) 

for all X. 

For 

F^(x) =F{xx) 

is clearly a distance function. The result now follows at once from 
Lemma 2 Corollary of Chapter IV. If t is non-singular we may apply 
Lemma 2 with instead of t and obtain the 

Corollary. If x is non-singular there is a constant c^ such that 
F(x) ^ c^F(xx) . 



Lemma 3 . Let F(x) be a distance function such that F(x)=0 only for 
x=o. Then to every s in 0<e<l there is an rj =r](e)>0, defending 
only on F and e, such that 



\ 



F(x) 



-f e 



( 13 ) 



for all homogeneous linear transformations t such that^ 



Ijx — i|l <r? (14) 

and all x. 

By Lemma 2 of Chapter IV there exists a number c > 0 such that 

F(x)'^c\x\ ( 15 ) 

for all X. Since F(x) is continuous in the sphere |ae|^2, there exists 
a number in 0 <j^i< 1 such that 



|^(®i) — -P(»2)i <C£, 

whenever 

j 2^1 ^2 1 ^1 » I 1 — 2, j X2 1^2. 



As before, l denotes the identical transformation. 
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In particular, this is true when jtCal =1; and so, by homogeneity, 

-f(®2)l <<^£l*2l. (16) 

whenever 

I ^2 1 ^ ^1 I ^2! ‘ ( 17 ) 

But now, by (5) and (14), 

|xa3 — ®| = |(t — i)£c| ^ «i||T — i|| |a;| <r]-^\x\', 

provided that n^rj<fji', which we may suppose. 

But then from (15) and (I6) with x^ = tx, x^ — x, we have 

(t*) — f (a;)| < eF{x ) , 

which is equivalent to (13). 

V.3. Convergence of lattices. If A is a lattice and t a non-singular 
homogeneous transformation, we saw already in Chapter I, § 3 that the 
set of TO, 06 A is a lattice tA with determinant 

(t A) = I det (t) I if (A) . (1) 

If M is any other lattice, it may be put in the shape 

M =tA 



for some non-singular homogeneous transformation t, and indeed in 
infinitely many ways. For if Oj , . . . , o„ , bj , . . . , 5„ are bases for A and M 
respectively, there is a uniquely defined homogeneous linear trans- 
formation T such that 



and then 



5, = to,- (1^i^«); 
M =tA. 



We say that a sequence of lattices \ (1 gr<oo) tends to the lattice 
A' if there exist homogeneous linear transformations t, such that 

A,=t,A' (2) 

and 

||t,-i||^ 0 (r-^oo), (3) 

where i is the identity transformation and ||t|| is as defined in §2. 
We write then 

A,^A'. 



From (1) and (3) we have immediately 
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If o is any non-singular homogeneous linear transformation it is also 
immediate that 



Indeed 

and 

so 

by (3) of § 2. 



aA,-4-aA'. 
oA, = aT,o“i {“A'} 

— i = o (t, — i) a *, 

aT,a-i-il|^||a||||a-i||||T,-i||-^0, 



Lemma 4- A necessary and sufficient condition that the sequence of 
lattices A, ( 1 ^r<oo) tends to A' is that there exist bases 



and 






by, ... ,b„ 



of A, , A' respectively, such that 

b'-rb'j (i^j^n) (r-^oo). (4) 

The last limit is meant, of course, in the sense of the ordinary con- 
vergence of vectors: |6' — 

The proof of Lemma 4 is almost trivial. Suppose first that A,->-A' 
and let t,. be the transformation satisfying (2) and (3). Choose any 
basis b'j for A' and put 

b' = T,b'j (\^i^n\ 1^r<oo). (5) 

Then by (5) of § 2 and (3), we have 

= IK- ill |b'| -^0 {r^oo). 

Suppose conversely that the bases are given satisfying (4). We may 
define t,, uniquely by (5). Then clearly ||t,— i||->0. 

The following criterion is rather less trivial. 

Theorem I. A necessary and sufficient condition that A,^A' is that 
the following two conditions be both satisfied: 

(i) if a' 6 A', there are points for r — \, 2, ... such that 

a' -^a' {r -^oo). (6) 

(ii) if c is not in A', there is a number r]>0 and an integer fo>0, 
both depending on c, such that 

\aP — c\>r} ( 7 ) 

for all a'eA, with r^r^. 
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It is quite straightforward that (i) and (ii) are satisfied when A,- 
In (i) we have only to put 



or = x,a', 



A'. 



where the t, are the transformations such that 

A, = T, A', 1 1 T, — 1 1 1 ^ 0 . 

Then, as before, 

1 1 o'’ — c' 1 1 g 1 1 T,. — 1 1 1 1 a' I 0 {r ->cx>). 

To prove (ii), we note that there certainly is an rji>0 such that 

|o' — c|>?;i (8) 

for all o' € A'. Put 

(9) 



Then 



Suppose, if possible, that there is a point o'eA, such that 

ja’’ — (iO) 

|«’’| ^ |c| (11) 

By the definition of t, we have 

a' = 'z,a' (12) 

a'~a’ — p,a', (13) 

1 



for some o' (E A. Then 

where 

Now 



I — T, 



\Pr\ 



■ 0 (r -> oo) 



by Lemma 1 and since ||t,— i|| ->-0. Hence by (5) of § 2 and (11), we 
have 

|a'-a'|^«J||p,|||o'|^«J||p,||{|c| +r)}<T] 



for all r greater than some r^. From this and (9) and (10) we have 

\a' — c\^2r] =rji. 

This is in contradiction to (8). Hence statement (ii) of the theorem is true. 

We must now show that if (i) and (ii) of the theorem are true then 
A,^A'. We require a lemma of some independent interest. 

Lemma 5. Let be linearly independent -points of a lattice A 

hut not a basis. Then A contains a point 

d = 
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where are numbers such that 

! ^ ‘ . ( 14 ) 

We first prove Lemma 5 ■ Since Cj , . . . , c„ is not a basis, there cer- 
tainly exist points 

a =aiCi H 

in A for which aj , . . . , a„ are not integers. We may suppose without 
loss of generality that 

Let t be the least non-negative integer such that 

2' max I ay I ^ 4 . 

Then 

2‘ max I a, I ^ ^ , 



will do what is required. A slight refinement of the argument, which 
is left to the reader, shows that the \ in (14) may be replaced by but 
by no larger number. 

We now revert to the proof of Theorem I. Suppose that \ and A' 
satisfy (i) and (ii). Let b[, . . . , b’„ht any basis for A'. By (i) there exist 
sequences of points 

b’-^b'j 5-cA,). (15) 

We show that by is actually a basis for A, except, possibly, 

for a finite number of r. For if 6J, ■ . . , bj, is not a basis for A,, let 



be a point of A with 



d, = &i,b[ + ■■■ +d„,b'„ 
i^maxli?y,l g-i- 



(16) 

(17) 



which is given by Lemma 5- Since the i9y, are bounded, they contain 
a convergent subsequence by a classical theorem of Weierstrass (cf. 
§ 1.2 of Chapter III), say 

lim^y„ = ^', (18) 

/— ^00 

where 



ri<r^< - - <r,< - - 



is an increasing sequence of integers. Then 

d' (say) = 2 by = lim d,^ , 

j ►OO 



Cassels, Geometry of Numbers 



9 
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by (15), (16) and (18). Hence d'cA' by (ii) of the enunciation of the 
theorem. This is a contradiction since 

^ max I I ^ 2 - , 

by (14) and (15), and since 6' (1 ^/^m) was defined to be a basis for A'. 
The contradiction shows that b' is a basis for A, except for a finite 
number of r. If the b' are changed for these exceptional r so that 5 - 
(l^/^f) is a basis for A, for all r this does not affect the limits (15). 
Hence the criterion is certainly sufficient by Lemma 4. 

V.3.2. In Chapter X we shall need the notion of a neighbourhood 
of a lattice, and we shall mention it again in passing briefly in § 9 of 
this chapter. 

A set 2 of lattices A is said to be a neighbourhood of the lattice M 
if it contains all lattices 

A = tM (1) 

with 



for some rj>0 depending on the particular neighbourhood. The neigh- 
bourhood 2 may contain other lattices A than those given by (1) and 
(2) ; but there is some rj>0 such that it contains all these. If a is any 
non-singular homogeneous transformation we show that the set a 2 of 
lattices a A, Ae 2 is a neighbourhood of aM. Indeed o2 contains all 
lattices 

N = o(aM) 

with 

since then 



N =aA 



where 
and then 
as in § 3-1. 



A =«“^ooM: 

|lo-ioo-i||^||a-i||||a||||o-il|<r? 



Clearly the sequence A, (1gj'<oo) of lattices tends to M if and 
only if every neighbourhood of M contains all but a finite number of 
the A,. 

Although we nowhere use it, we note that it is in fact possible to 
introduce explicitly a metric into the space of all lattices. Let A and M 
be two lattices and let 
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where the infima are over all non-singular o and t such that 



A = o M M = tA. 



Put 

D(M, A) = D(A, M) = max{log(1 + n), log(t -t-v)}. 
Then we have the triangle inequality 

D(A, N)^D(A,M) -f D(M,N); 

since if 

A = (I + Pi) M , M = (i -h P2) N ; 

then 

A = (i + Pa) N . 

where 

IIP3II =||pi + p2 + pip2l|^||pil| +IIP2II +||pilll|pa| 

and so 

logO +||P3ll)^log(l +IIP 1 II) +log(t -fllpall). 



The neighbourhood defined above is the one associated with this 
metric, since if 

A = oM 

with 
then 
where 



and so 



J - i||< rj<\-, 
M =o-iA, 



ill < 



0-1 



O- M 



< 



1 -Tj 



D(A, 

1 



V.3.3. The continuity of the distance-function F(x) of the vector x 
is reflected as a semi-continuity of the function 



F(A) = mf F{a) (1) 

oe A 

+ 0 

of the lattice A considered in § 4 of Chapter IV. For certain later 
applications it is useful to allow the distance-function F and the lattice A 
to vary simultaneously. 

Theorem II. Let A, (l^y<oo) be a sequence of lattices tending to 
the lattice A'. Let F,{x) (l^r<oo) be a sequence of distance functions 
which converge uniformly to the distance-function F'(x) in the unit sphere 
|x|<1. Then 

F'(A') ^ lim sup i; (A,) . (2) 

f— ►CO 



9 ’ 
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The proof is very simple. Since F^{tx) =tF^[x) for f>0, the con- 
vergence of F,{x) to F'[x) is uniform in any bounded set of points; 
in particular, since the distance-function F'[x) is continuous by defini- 
tion, if a, is any sequence of points converging to a point a', we have 

lim i;(a,) =F’{a'). 

f— ► OO 

But by Theorem I, every point a'=|=o of A' is the limit of points 
of A,. Hence 

F'(a') = lim F,{a^) ^ lim sup JF; ( A,) , 

f— *-co r— >00 



since F^{a^)'^F,(A^). The result (2) now follows from the definition (1). 

The sign of equality need not hold in (2) even when F,=F' for all r, 
but we defer giving an example until § 10.5- However, much more 
than Theorem II is true if F'[x) = 0 only for x = o, i.e. if the set F'(x) < 1 
is bounded (Lemma 2 of Chapter IV). 

Corollary. Suppose that the hypotheses of Theorem II hold and that 
the only point x such that F'(x) =0 is x = o. Then 

lim T;(A,) 

f— >oo 

exists and is equal to F'(A'). 

The proof is similar to that of Lemma 3- By Lemma 2 of Chapter IV, 
there is a OO such that 

(3) 

for all X. Let e>0 be arbitrarily small. By the uniformity of the con- 
vergence of F,{x), there is an r# such that 

|F,(®)-F(a!)|<c£ (4) 



for all ® "'ith |x| =i. Hence for all x whatsoever and 

have 

I F, («) - F(£c)| < c £ I a; I g £F(a:) ; 



so 



I — £ < 



fi(x) 

F'(x) 



< I + £. 



( 5 ) 



Now let A,=t,A', where are homogeneous linear transformations 
such that 



||t,-i||-^ 0 (r->oo) 



in the language of § 2. Then 



1 — £ < 



F'(x) 



<1 -he 



( 6 ) 
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for all r greater than some r^, by Lemma 3- Hence by (6) and (5) with 
T,a5 for X we have 

for all r>max(rj, fj). But now A, is just the set of t,x with x^N, 
and so^ 

Since e is arbitrarily small, this proves the corollary. 



V.3.4. An almost immediate consequence of Theorem II, Corollary 
is the following result, which shows that no bounded star body can 
have successive minima in the sense of Chapter II, § 4. 

Lemma 6. Let F[x) be an n-dimensional distance function which 
vanishes only when x = o and let rj be any number for which 

0<??<(5(F) =supi^i^. (1) 

Then there exists a lattice M,, such that 

{F{M^)Y = r,d{M^). 

After Theorem VI we shall be able to replace the second < in (t) 
by 

Suppose that rj satisfies (t). Then there exists a lattice N, such that 
{F{H)Y>rid{H). (2) 



Let bj , . . . , 6„ be any basis for N ; and for 0 < e < t let be the lattice 
with basis 



Then 

and 

Hence 



ebi, &2, ..., 

i(NJ =£rf(N) 



F{Nf}^F(eb,)=eF{b,). 



d(N,) - d(N) 



(£-^ 0 ). 



( 3 ) 



But now, by Theorem II Corollary, F(NJ is a continuous function of e. 
Hence by (2) and (3) we may put M,, = for an appropriate value of s. 

V.3.5. For the sake of completeness we enunciate the following 
lemma, which interprets the uniformity of the convergence of F,(x) to 



* Note that F'(A') =t= 0 by Lemma 7 of Chapter IV. 
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F'{x) in terms of the corresponding star bodies 

K- Fr{x)<i ( 1 ) 

and 

F'(x)<i. (2) 

Since we do not use the lemma we do not give the proof, but the reader 
should have no difficulty in constructing one along the lines of the 
proof of Theorem I of Chapter IV. 

Lemma 7- A necessary and sufficient condition that the sequence of 
distance functions F,{x) tend to the distance-function F'(x) uniformly in 
I ac I ^ i is that the bodies and 6^' defined by {\) and (2) have the following 
properties: 

(i) If c is an (inner) point of If' , then there exists an rj>0 and an 
integer r^ (depending on c) such that all points x of the neighbourhood 
\x — c\<r] belong to ^ for all r greater than r^^. 

(ii) If c is an exterior point to Sf' {i.e. F'{x)>i) then there is an 
Tj>0 and fj such that no point x of the neighbourhood |a; — c| belongs 
to for any r>r^. 

V.4. Compactness for lattices. In this section we are concerned 
with conditions under which an infinite sequence A, of lattices should 
contain a subsequence M, = A,, which converges to a lattice M', not 
necessarily belonging to the sequence. 

The simplest such condition is when every lattice of the sequence 
has a basis every point c/f which hes in some fixed sphere 

( 1 ) 

and d{f\() is bounded below by a positive constant, say 

d (A,) ^ PC > 0 (all r) . (2) 

Since all the lattices have bases in (I) we may by Weierstrass’ com- 
pactness theorem, find a subsequence of lattices M( = A^^ with bases 
b{, in (t ) such that all the limits 

lim bj = 5.' 

exist. By (2) we have 

|det(6{, ..., 6')| = lim |det(6l, ..., b(,)| = lim (MJ ^ x> 0: 

(->00 (->oo 

and so b'l, ...,b(^ are linearly independent. Hence there exists a lattice 
M' with basis b'l, ...,b'„ and, by Lemma 4, 

(t->oo). 
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A slight extension of this idea gives the following theorem which 
however turns out not to be very useful. We give it partly for historical 
interest and partly because the lemma on which it depends will be used 
later. 

Theorem III. Let A, (1 ^r<o6) be an infinite sequence of n-dimen- 
sional lattices enjoying the following two properties: 

(i) there exists an R such that every A, has n linearly independent 

points in the sphere , , ^ ^ 

(ii) there exists a x>0 such that 



d{\)^x 



for all r. 

Then A, contains a subsequence of lattices M, for which 



M' = lim M, 

/— >00 

exists. 

The proof of Theorem III depends on the following lemma due to 
Mahler (1938a) and rediscovered by Weyl (1942a). 

Lemma 8. Let F{x) be any symmetric convex distance function and 
be n linearly independent points of a lattice A. Then there 
exists a basis 6j , . . . , 5„ of A for which 

F{bf} ^ max[F(o^), -J-{F(Oi) 4 h-F(a,)}]. 

Before proving Lemma 8 we show that Theorem III follows from it 
by applying it to the convex function F(a;) =|®| and to the n linearly 
independent points a{, of A, given by (i) of the theorem. Then 

Lemma 8 shows that A, has a basis b\, with 

1 5)1 g max [|o)|,-mo)l 4 \-\a'\]]^nRj2. 

We have thus reduced Theorem III to the trivial case discussed at the 
beginning but with nRj2 instead of R. 

It remains to prove Lemma 8. By Theorem I of Chapter I there is 
a basis Cj , . . . , c„ of A such that 



« 



— 1^21^1 4" 1^22®2> 



( 3 ) 



where the Vij are integers and v^,=4=0. We shall take 5,- of the shape 

bj = Cj + tj^ ay_i + • • • + j O], e A , (4) 
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where the are numbers to be determined. Clearly 6j, is a 

basis for A for any set of numbers i,, such that 6y€A. 

We distinguish two cases for each j. If v,;j= ±1, we put by= ± Oy. 
This certainly has the shape ( 4 ) and 



F{hj) = f(ay) . 

Otherwise \vu\^ 2 . On solving (3) for the Cy we have 

Cy = + ^y,,-ia,_i + • ■ • + AlyiOj , (5) 

where Ay, are certain real numbers. Choose fy, in ( 4 ) to be integers such 
that 



Then by 6 A and 

where 

and 



1 ^;.' + ^'n\ ^ i- 

I/.-,! 



( 6 ) 



\h\=\^ii + hi\^i (*<?)• 

Then by the convexity symmetry and homogeneity of F(x) we have 



f(by)^T-(/yyay)+...+f(/yiai) 

= |/yy|f(ay)+---+|/;l|F( 0 ,) ■ 

^i{F(a,) + ...+f(Oi)}. . 



( 7 ) 



This concludes the proof of the lemma. 

When F{x) is the usual euclidean distance, an argument due to 
Remak (1938a) gives a sharper result. See also van der Waerden 
(1956a). 

When 

F{a^)^F[a^)^---^F{a„), ( 8 ) 

Lemma 8 gives 

F(by)^max(l, ”)F(Oy). 



V.4.2. We owe to Mahler (1946d, e) a criterion for the existence 
of a convergent subsequence of lattices in a sequence of lattices, which 
is much more fertile of applications than Theorem III, and which may 
be said to have completely transformed the subject. Mahler proved 
his criterion by using the theory of successive minima^ of a sphere to 
show that it is equivalent to that of Theorem III. We shall give 



* Not to be confused with the “successive minima” discussed in Chapter II 
which are quite different. 
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Mahler’s argument^ when we discuss successive minima in Chapter VIII, 
but here we give a direct treatment due to Chabauty (1950a), who 
shows that it generalizes significantly to a more general situation (sub- 
groups of locally compact topological groups). Mahler’s criterion is 
expressed in 

Theorem IV. Let A, he any infinite sequence of lattices satisfying the 
following two conditions 

(i) d(\)^K for all lattices A,., where K is independent of r. 

(ii) |A,,|^f<>0 for all r where x is independent of r and, as usual. 

I A| = inf |a| . 

' ' aeA' ' 

4=0 

Then A, contains a subsequence M,, = A,, which converges to a limit M'. 

We prove^ Theorem IV by induction. The result of the /-th stage 
(1^/^w) will be the following statement: 

©y ; There exist f linearly independent points Oj , . . . , Oy and a sub- 
sequence N< = N| (1 ^ i < oo) of A, which satisfies the following conditions : 

©': Each point a, (l^f^/) is the limit of points 

a'6N,-N' (1) 

Suppose that L<t,^<--- is any increasing sequence of integers 
and there exist points such that 

limc,=yiaiH hy/O,' (2) 

S — >-00 

with real yi , - . . , yy . Then yj , . . . , yy must be integers. 

Before continuing the proof we note that the statement imphes 
that the lattices M( = N" converge to the lattice M' with basis Oj , 
the parts and ©(,' of corresponding respectively to (i) and (ii) of the 
Theorem I . Hence it suffices to prove ©„. 

We do not give a separate proof of ©j since that is a simple version 
of the deduction of ©yy.j from ©y. For the rest of this section we shall 
assume therefore that ©y holds for some / in 1^/<« and will deduce 
©y+i. The sequence will be a subsequence of the sequence N< = N) , 
and the points Oj, ..., ay will be the same in ©y and ©y+j. 

A non-singular homogeneous hnear transformation of the variables 
does not affect either statement ©y or the hypotheses of the theorem, 
though it will in general replace the K and x in (i) and (ii) by different 
numbers. Hence we may suppose without loss of generality that 

a, = e, = (or^, 1,0~^) (i^i^j). (3) 

' The reader may prefer, instead of studying the proof here, to turn to §§1,2 
of Chapter VIII, which are independent of the intervening matter. 

* I now prefer the proof given by Chabauty (1950a) to the version given here. 
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Define the number y> by 

(!)>"-' = ^. (4) 

where K is the number occurring in hypothesis (i) of the theorem. By 
Theorem III of Chapter III, each lattice N, contains a point x^=o with^ 

\Xi\£f a + i£i^n).] 

Let c‘ be one of the finite number of points of other than o in (5) 
for which 

max I X, I (6) 



is a minimum. Since the c‘ are in the bounded set (5), they contain a 
convergent subsequence, say 

(7) 

where 



f 1 f 2 ‘ • 

Write 

O'j+i = {Ai, . . . , A„) , (8) 

so that clearly 

M,|^f 1 



Suppose first, if possible, that =^„ = 0, so that ay_,_i is linearly 

dependent on Oj , ... ,a^. We are assuming statement ©y to be already 
established. Hence by (7) we could apply ©" with yi=A^ 
and it would follow from (9) that Ai=-- - =A, = 0, and so 



lim c‘‘ = o. 

s->oo 

This contradicts hypothesis (ii) of the theorem, since c'*£N(, = A, for 
some r and c*'=bo. Hence the vectors ai,...,ay+i are hnearly inde- 
pendent. We put f^=Nj,, and will show that the statement ©y+j 
now holds for Nj+^ = FJ. 

The statement ©y^^j is trivially true. So far as ay_n is concerned, 
©'+j follows from (7); and so far as the remaining Oj are con- 

cerned, @'+i follows from ©' since is a subsequence of N,. 

It remains to prove ®,>i. Suppose, if possible, that there is an 
increasing sequence of integers 

Si<S2<---<s„<--- (10) 

and vectors 

*tn 



* The only property of |- we use is J < f < 1 • 
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such that 



lim d'"‘ = d (say) 

/^OO 

= <5iOi + ■ ■ • + ^,+ia/+i 



( 11 ) 



where (5i , . . . , are not all integers. By ©y+i > which we have already 
proved, we may add integer multiples of Oj, to the right-hand 
side of (11), after appropriate modification of the sequence d'”. Hence 
we may suppose in the first place that 



I ^f+ll = 2 



( 12 ) 



and in the second place, by (3), that 






(1^1^/), 



(13) 



where, as usual, d = {di, ...,i„). From (8) and (12) we have 



max max \AA 

; + lSiSn' ' ' j' + lSiSn ' 

<i max \A:\ 

< rp. 



(14) 

(15) 



We now show that this in contradiction with the definition of the vectors 
c' as the vectors a? of N, in (5) other than o for which (6) is as small 
as possible. Since we have 

lim max Ic.-J = max \AA, (16) 

>■—►00 

where 

c* = {Cii, .... C^i) . 



By (13) and (15) the vector d*” certainly lies in the region defined by 
(5) when m is large enough. Further, d^^cNj-, where T But 

now, by (14) and (I6), the function (6) is certainly greater for than 
it is for d*" when m is large enough, which contradicts the choice of c^. 
The contradiction shows that if (11) holds then ..., are all 
integers; that is the statement holds. 

This ends the deduction of ©,41 from ©^ , and so concludes the proof 
of Theorem IV. 

We note a form which is often useful in applications and which does 
not depend on the use of the special distance-function |aj|. 

Corollary. Let F(x) he any distance junction and let A, he any 
infinite sequence of lattices satisfying the two conditions 

(i) d(Af)^K for all r, where K is independent of r. 

(ii) F(A,) for all r, where x is independent of r and, as usual, 

F(A) = inf F(a). 

og A 
+ o 

Then A, contains a convergent subsequence. 
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For by Lemma 1 of Chapter IV there is a C > 0 such that F(x) g C | x | , 
and so 

V.4.3. An almost immediate consequence (cf. Mahler 1949a) of 
Theorem IV is 

Theorem V. Let be any open set. Let 5^, 5^^, .5'^, ... be a 
sequence of open subsets of Lf such that 
(i) Sf, is contained in ^ if r<t, 

( ii ) the origin is an inner point of 6^^ , 

( Hi ) every point x of 6^ is in Lf, for some r. 

Then 

A(6T) = lim 

r^oo 

We recall that A{.^) is the lower bound of d{A) over 5^-admissible 
lattices A, i.e. A having only o in Clearly 



for all r. Suppose that 



liminfA{y,)<A(Sr). 

r~*oo 



( 1 ) 

( 2 ) 



Then there is an increasing sequence of integers r^<r 2 < - and 
lattices A,^ such that 

hmd{\)<A{£T)-, 



and A,, is i'’, -admissible. By (ii) and Theorem IV we may extract a 
convergent sequence of lattices from the A,,, so that without loss of 
generality 



limA,=A'; d{A')<A{y). 

t ~*-00 ‘ 



Hence there is a point p 4= o of A' in S^. By (iii) then p is in .5^ for 
some R. By (i) and since .9^ is open by hypothesis, there is a neigh- 
bourhood 






( 3 ) 



every point of which is in for all r^R. 



But now 



p = lim p’, p' e A, 

r— »• 00 



by Theorem I. Hence x=p' satisfies (3) for all r greater than some r^. 
For r> max {R, r^ this means that p^ is in £T contrary to our assumption. 
The contradiction arose from the assumption that (2) is true. Hence 
the theorem is true by (1). 
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When the ,9^ and ff, are star-bodies, Theorem V follows fairly 
immediately! from Theorem II but we shall in fact apply Theorem V 
when is not a star-body in Chapter VIII. The proof of Theorem V 
gives also the following corollary which is a trivial consequence of the 
theorem when the ,9’ and ,9^ are star-bodies, but which is valid when 
they are not. 

Corollary. Suppose that for l^r<c» there is an 9^-admissible 
lattice f\, with 

d[\) =4, 

for some number . Then there is an 9-admissible lattice A with d (A) . 

V.5. Critical lattices. Let F{x) be a distance-function. It may 
well be that F(A) =0 for every lattice A, in which case we say, following 
Mahler, that the distance function and its associated star-body are of 
“infinite type’’. An example of a distance function of infinite type in 
two dimensions is 

F{x) = ;t 2 p- 

Any lattice A of determinant d(A) =d contains a point o = (aj, a^) =t=o 
with 

|ai| ^ l^al ^ djs, 

where e>0 is arbitrarily small, by Minkowski’s convex body Theo- 
rem II of Chapter III. Then 

F{a) ^\ed\^ 

is arbitrarily small, so F(A)=0. It is not always possible to decide 
whether a distance function is of finite type or not, for example, this 
is not known in the case of the 5 -dimensional distance functions 

F{x) = \x\ + xl A- xl A- xl - 

and 

F(x) = \ xl A- xl A- xl — xl ~ xl\K 

The problem whether these functions are of infinite type or not is equi- 
valent to the problem whether aU indefinite quadratic forms in 5 vari- 
ables represent arbitrarily small values (including 0) or not for integer 
values of the variables (cf. § 3 of Chapter I). A classical theorem of 
Meyer says that if the coefficients of the form are rational then it 
represents 0. Recently Davenport and more recently B. J. Birch have 
developed an attack on this problem but it appears to work only for 

! When the 9 and 4 star-bodies, say, with distance-functions F[x) and 
F, (ac), the hypotheses of Theorem V imply that, for each x, F^{x) tends monotonely 
to F(x). Since F^(x) and F(x) are continuous, this convergence must be uniform; 
and so Theorem II applies. 
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indefinite forms in more variables than 5 [see Davenport (1956a) and 
later work of Davenport and Birch]. The results of Chapters VI and X 
sometimes permit one to decide whether a given distance function F{x) is 
of finite type or not but beyond that very little is known. For another 
unsolved problem of this type with important implications see Cassels 
and Swinnerton-Dyer (1950a). 

Most of the investigations in the geometry of numbers are concerned 
with distance-functions F of finite type, i.e. not of infinite type. Then 



d{F) = sup 
A 



{F(A)}" 

d(\) 



( 1 ) 



s strictly positive. Then by Theorem VI of Chapter IV, 



0 < (5 (F) < oo (2) 

and 

d{F)A[£^)=\, (3) 



where A{^) is the lattice constant of the set 

6F-. F(x)<\. (4) 

We recollect that a critical lattice for ,5^ is a lattice A which is 
^’-admissible and which has determinant d[/\) =A(y’) (Chapter III, §5). 
A general theorem of Mahler states that a set y of the type (4) always 
has critical lattices if it has admissible lattices. 

Theorem VI. Let the distance-function F[x) he of finite type. Then 
there exist lattices A such that 

F(A)=1, ^(A)={<5(F)}-i=zI(^), 

where 6[F) is defined in (1) and A{y) is the lattice constant of the region 
defined by (4). 

The proof of Theorem VI is now quite simple. By the definition of 
A{y), there exists a sequence of lattices A, such that 

F(A,)^1, d{\)-^A[y). (5) 

We may now apply Theorem IV Corollary 1 , its conditions (i) and (ii) 
being satisfied by (5). Hence there exists a convergent subsequence, 
and so, after a change of notation, we may suppose that 

for some lattice A'. By (5) we have 

d[^') = lim d{\) =A{y). 




Critical lattices 



143 



By (5) and Theorem II, we have 

F(A') ^ lim sup /‘'(A,) 1 , 

r-vcc 

If /‘(A')>1 there would exist a real number (?<1 such that 

A(^A')^1, d{d A') =rd{A') <A{y): 

in contradiction to the definition of as a lower bound. Hence 

F(A')=\. This concludes the proof of the theorem. 

In evaluating for star-bodies £A we may therefore confine 

attention to critical lattices. 

There is an alternative formulation of Theorem VI which does not 
need to distinguish between the two cases 6[F) =0 and ^(A)>0: 

Corollary. For every distance-function F{x) in n-dimensional space 
there is a lattice M such that 

d[M) =1 

and 

{A(M)r = ^(A)=supi^<^»’’-, 

For if d(F)=0, any lattice M with will do. Otherwise 

M =§A' will do, where A' is a critical lattice and {f is chosen so that 
c^(M) =1. 

V.5.2. It would be natural to assume that every critical lattice A 
for a star-body 

y-. F[x)<'[ 

should contain a point a with F{a) = 1 , but in fact this is not the case 
even in 2 dimensions. Here we construct a counter-example using the 
phenomenon of successive minima discussed in § 4 of Chapter II. Write 

F^[x) =\x^x^\i. (1) 

Theorem IV of Chapter I when translated into our present language 
implies that 

{F„(A)}^gi(A)/8-^ (2) 



except when A is a lattice A^ with basis 

^ 2 l ) 1 ® 2 “ (^ 12'^2 2 ) 



such that 



(wj a ^^ “b 1^2 ^1 2) (^1 ^2 1 “b 1^2 ^2 2) — ^ (^1 F ^2 ^2) 



( 3 ) 

( 4 ) 



identically in «i,M 2 some number A; in which case 



{Fo(Ajp = ^(AJ/5V 



( 5 ) 
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In particular 






Now consider the distance-function 



( 6 ) 



F^{x) =Fo(oe) 



so that 









(7) 

( 8 ) 



From (8) and (2) or (5) we have 



if A is not a A^; and 
respectively. Since 



{F,(A)P^(-^)^i(A)/8» 

{F,(Aj}^^i(A,)/5i 



(9) 

( 10 ) 



a critical lattice for (ac) < 1 is necessarily a A^ . 

We show now that equality holds in (10). After a possible inter- 
change of Xi and we may suppose that 

^1 ^1 1 “h ^2 2 “ ^11 (^1 “ f “ ^ ^ 2 ) 

^1 ^2 1 d ” ^ 2^2 2 ~ ^21 (^1 d “ W ^ 2 ) » 

where 

2(o=i+SK 2y)=1 — 5^, ^=«ii'*2i. 

on factorising the right-hand side of (4). Here 



Since 



coy) = — 1 . 



we have 

o)‘ = -f «2 • o> ; -f f ; 

for every positive integer i and certain integers m!.'* 
1 / (say) = (aiiCo‘,aiiy/)eA,. 



Hence 



But now, since <oy> = — i, we have 



^i(y‘) — I ^ii^2i|^ 

= kK 



1 + 



100{laji|co'-|- . 



I ^11 <*21 



Hence 

{F,{\)Y^k=d{W- 
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This with (10) gives 

But now if a 4= o is a point of , we have trivially 

{F,ia)Y>{F,{a)Y^d{A,)lsi. 

In particular, if k = \, so that d(A^) = =A{£^i), where is the 

region (x) < 1 , there are no points a of A^ on the boundary (x) = 1 
of A,. 

By an ingenious argument, again using the phenomenon of successive 
minima, Rogers (1947c) has constructed a distance-function F(x) such 
that the critical lattice of the unbounded star-body -F(x)<l has only 
one pair of points +a with F(±a) =1. All other points b^o of A 
satisfy F{b)^t for some explicitly given t>\. This is in striking 
contrast with the results we shall prove in §6 about bounded star-bodies. 

V.6. Bounded star -bodies. For bounded star-bodies a great deal is 
known about critical lattices. [See in particular Mahler (d, e) and for 
an extremely detailed treatment of the 2-dimensional case Mahler 
(a, b, c).] In contrast to the negative result of § 5-2 we now have 

Theorem VII. Every critical lattice A of a hounded star body if has 
n linearly independent points on the boundary of . 

For suppose not. Then there exists a basis bj, ...,&„ of A such that 
any point 

p ••• +m,. 5„ («!,...,«„, integers) (1) 

of A on the boundary of has u„ — 0. Since iF is bounded, there exists 
a number Y such that if a point 



yibjH hy„b„ 

with real y\, ■■■, y„ is in or on the boundary of then certainly 
|y,| ^ y (1 g t g «) . 

Now let £ be a number in, say, 

0 <£< 2 -, 

and let A^ be the lattice with basis 

bi,...,b„_i and (1-e)b„. 

Consider a point 

p, = Ml bi H h M„-i b„_i M„ (1 - e) b„ 



Cassels, Geometry of Numbers 



10 



( 2 ) 
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of Aj, where are integers. If m„= 0, then is in A; and so 

is either on the boundary of or outside If 

max I M, I > 2 y , 

lS;Sn ' " 

then certainly is outside y. We need therefore consider only the 
points with 

max|My|^2Y, w„=j=0. (3) 

But now for these m, the corresponding point p given by (1) is an 
exterior point of y, since m„=|= 0; that is some whole neighbourhood 
of p lies outside y. Hence p^ cannot be in y for all s smaller than 
some £(,, which may depend in the first place on Mj, But there 

are only a finite number of to consider, by (3), and hence 

A, is y-admissible if s is small enough. But now 

d(A,)={\-e)d{A)=(i-s)A(y), 

since A was assumed to be critical. But this contradicts the definition 
of A(y) as the lower bound of the determinants of admissible lattices. 

It is only exceptionally that there can be as few as n pairs of linearly 
independent points ±ay (1 ^j^n) of a critical lattice on the boundary 
of y. Rather surprisingly, it is possible, however, at least when n = 2, 
for a star-body to have a continuous infinity of critical lattices each 
with only n pairs of points on the boundary, see Ollerenshaw (1945 a). 

Corollary. Suppose that (1 n) are the only points of A 
on the boundary of y . Then there exists an Sg such that all points 



with 



®n + £i®i + • ■ • + 



1 



max 



cyl eg 



(4) 

(5) 



are either in or on the boundary of y . 

For Cj , . . . , a„ are linearly independent by the theorem ; and so 
there exists a basis bj , . . . , 6„ such that 



Oy = bi H -1- (1 g t ^ n) (6) 

with integers v^j and v,;=f=0. Let A^ be the lattice with basis 

where 

+ >?l^l + • ■ • + »?»-lb„_i, (7) 



and rji, are small real numbers. As in the proof of the theo- 

rem, if max|?yy| is small enough, the only points of A^ which can lie 
in or on the boundary of y are rh«i, ±®n-i (which are unchanged 
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by the substitution of 6^ for 5„) and ±®n. where 

a'l (say) == v„ 1 6 , -f • ■ ■ + v ,,^ - 1 _ i + v„ „ 6'' . (8) 

But 

d (A,) = I det (f>i , . . . , f>,, , . 6;!) 1 = I det (bi , . . . , ft,.) | = ( A) = Zl (^) . 

Hence either a’l is in .9^, when there is nothing more to prove, or AJ is 
critical, and then a’l is on the boundary of ^ by the theorem. Since 
every vector of the shape (4) can be put in the shape (8), where max|r 7 , | 
is small if max|£,| is small, this proves the corollary. 

V.6.2. For the continued study of the points of a critical lattice 
on the boundary of a bounded star-body, we need an estimate of 
det (Oj , . . . , aj in terms of 

|«,1 



where Oj , . . . , a„ are any «-dimensional vectors. For our present purposes 
any estimate, however crude, would suffice, but, since we shall later 
need a more precise estimate, we prove it here. 

Lemma 9 (Hadamard). Le( a^, a„ be n-dimensional vectors. Then 
ldet(ai,...,a„)lg|ai|...|a„|. 

We note that the simple example 

aj = e,. = (o, . . . , 0, t, 

shows that ^ cannot in general be improved to <. The inequality is 
the M-dimensional analogue of the fact that the volume of a parallelo- 
piped is at most the product of the length of the sides. 

If the determinant is 0 there is nothing to prove. Hence we may 
suppose that , . . . , a„ are hnearly independent. We construct a 
sequence of vectors Cy such that 





CyCy = 0 {i^j) 


(1) 


(scalar product of two vectors), and 




Oy = 




(2) 


for some real numbers 


f,y. Indeed if c^^a^ and the Cy 


are defined 


recursively by ^ 


: = ay-2(a.Cy)lcy|-2cy, 





j<‘ 



it is readily verified that the Cy have the required properties. By (1) 
and (2) we have 






+ ••• +<M-l|Cy_il^-b|Cyl2^| 



det(ai,...,o„) =det(ci,...,c„). 



10 ’ 



(4) 
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On the other hand, on regarding the , . . . , c„ in det (Cj , . . . , c„) first as 
rows and then as columns and multiplying the two determinants to- 
gether, we have^ 

{det (Ci , . . . , c„)p = det {c.c,} = /7 1 c.P, (5) 

by (1). The required inequality now follows from (3), (4) and (5). 

V.6.3. We may now show that, in principle, the evaluation of Zl(y’) 
for a bounded w-dimensional star-body ^ may be reduced to a finite 
set of ordinary minimal problems. Except for convex bodies, for which 
see § 7, this is hardly in practice a fruitful approach, though it might 
well be adaptable to machine computation. 

We may suppose without loss of generality that S/’ is defined by 

Se-. E(a;)<l, (t) 

where F(®) is a distance-function. By Lemmas 1 and 2 of Chapter IV, 
there are numbers OO and C such that 

c|a;| ^F(a;) g C|a;| . (2) 

In particular, a lattice A admissible for has no points in the sphere 

|a;| < C'l, 

and so has 

i(A)^2-"C-”F„, (3) 



by Minkowski’s convex body Theorem II of Chapter III, where is 

the volume of the unit sphere | a; | < 1 . 

Now let A be a critical lattice, so that there are (at least) n linearly 
independent points eq, ...,a„ of A on the boundary F(x)=\ of y. 
Then by (2) we have 

loyl^c-i (4) 

and so by Hadamard’s Lemma 9 we have 

|det(ai,...,a„)|^c-». (5) 



Hence in the language of Chapter I the index / of , . . . , a„ in A is 



j _ |det(gj g„)| 

d(A) 




( 6 ) 



Hence by the corollaries to Theorem I of Chapter I, there is a basis 
6i , . . . , b„ of A such that 

= \-Viibi, (7) 



* Alternatively one may observe that, by (1), 2 ■*. e; * = 2 I and compare 
determinants. * 
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where the v,, are integers, 

0^v^,<Vii ( 8 ) 

and 

0< n Vii = I^Io- (9) 

There are thus only a finite set of possibility for the integers fj,. For 
each set of integers the points on the boundary determine the 
by (6). The a, are to be chosen so as to make 

dIA) — ■■■.«»)| 

a minimum, subject to no points of A being in and, in particular, 
subject to (3). Then A{6^) is clearly the minimum of d{A) over the A 
so obtained and over all of the finite number of choices for the v^j. 
We now verify that if A is a lattice constructed with n points 
on the boundary and satisfying (3), (5), (6), (7), (8), (9), 
and if d were any point of A in then d has the shape 

where bounds can be given for the integers Uj. Indeed then 
and so for each integer j we have 

|det(Oi, ...,a,_i,d,a,+i, ...,a„)| 

by (4) and Hadamard’s Lemma 9- Hence, if (3) is true, the index of 
Cj , . . . , ay_i , rf, Oy+i , . . . , a„ in A is at most 7 q for / = 1 , 2, . . . , » : and 
it is easily verified that this gives bounds for the It is thus, in 
principle, a finite problem to find A{£^). 

The lattice constants of a great many 2-dimensional bounded star- 
bodies have been evaluated. There is a partial list in Keller (1954a) 
to which may be added among others the bodies discussed by Olle- 
RENSHAW (1945a, b, 1953g)- The treatment of bounded non-convex 
body in more than 2 dimensions by such methods seems inevitably 
laborious. Perhaps the only cases worked out are those of N. Mullineux 
(1951a). 

V.6.4. In the evaluation of A[£^) for a given star set 6^ it is usually 
best to combine the techniques just introduced with those discussed 
in Chapter III. We consider an instructive example due to N. Mullineux 
which we shall have occasion to discuss further in § 7. 

Lemma 10. Let k be an arbitrary positive number and put 

D = (yfea + 4/^)1, 



g — i (^ + 2 -|- D) , 



and 
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SO that 

g-^ = \{k + 2-D). 

Let y he the 2-dimensional star-body defined by 

— i<XiX2<k, \x^-\-Xo\<D. 

Then 

A[9’) =Z). 

The only critical lattices have bases of one of the two following kinds 
(i) the point (1, —1) and any point on x^-\- x^ = D, 

(ii) the points 

p = (-g-i^,gri), q = {-t,t-i) 



where t is any number in the range 

We must first verify that the lattices defined above are ^^-admissible. 
This is certainly true for (i). We now verify it for (ii). It is readily 
verified that the hne Xi-\-x^ = D meets x^X 2 = — \ in the points 






Hence the points p and q above do lie on the portion of the boundary 
of given by = — 1. The point 



r =p-q={\{-k -[-D)t, \{k+D) fi} = {r^.r^) 

lies on 



Further, 

since \<t<g and 



r^r^ = k. 
0<r^-\-r^<D , 
L^^k+D)t+\{k+D)t-^ 



equals D both for t = \ and for t=g. Hence a lattice of type (ii) has six 
points ±p, ± 9 . ±*' on the boundary of y. There can be no further 
point of the lattice in since it is easy to verify that every point 
of y except ±r lies either strictly between the (infinite) hne A through 
p and r and its image — A in o ; or strictly between the hne p through 
q, r and its image — p in o; for example the line A meets XjX 2 = — 1 
and x-yX^—k respectively apart from p, r in the points 

[gt, — {l(k + D)t, 1 (— ^ + 

and for both of these \xy-\-x^\>D. 
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For later use we note that the whole of the line-segment joining 
p, r must lie in ^ except the end points, since a line can meet a hyper- 
bola — 1 or = k in at most two points. Hence the whole 

of the closed parallelogram with vertices at o, p, r and — q must lie 
in ,9’ except for p, r and — q. 

We are now in a position to prove Lemma 10. Let M be a critical 
lattice. Suppose, if possible, that there is no point of M on the portion 

f \Xi + X 2 \<D 

of the boundary of Then the set of points^ 

M,: — s) Xi + 8X2, exi+ (\ — e) x^}, [x^,x^^lA, 

for small enough e, will also be ^’-admissible since 

{(1 - e) -b £ (fi -f (1 - e) vj = x^ + X 2 

and 

{(1 — e)xi + e X2){e ^1 + (1 — e) X2} = X1X2 + {e — e^) {x^ — X2]^ ^ %^ 2 - 
Since 

= H - 2 e) d{M) < A{ 9 ^) , 

this contradicts the hypothesis that M is critical. Hence there is a 
point q = (qi, on the boundary x 2 X 2 ~ — \ of and, by symmetry, 
we may suppose that 

-9i^1^92>0. 

Suppose first that q=j= ( — 1, 1). Then 

for some t m \<t<g. Let us identify this q, with the q of the lattice 
A introduced earlier, and let p, r have the meanings introduced then. 
Since A is admissible and M is critical, we have 

The line A of points x with 

det {x, q) =d (A) 

passes through p and r, so the line 

det(a;, g) = (M) (1) 

must either coincide with it or lie between it and the line through o 
and q. But now q is a primitive point of M, since r~^q if for any 

* This argument becomes more transparent on introducing temporarily the 
co-ordinates Vj = J- f {x^ — x.^) ■ 
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integer r> i ; and so there are points of M on the line (1) and at a distance 
|q| apart. Hence there must be a point of M other than o and —q 
in the closed parallelogram with vertices at o, —q,p and r. But we 
have already seen that the only points of this parallelogram which are 
not in 6^ are the vertices p, r and — q. Hence either p or r is in M ; 
and in both cases then M coincides with A. 

There remains the possibility that q = (— 1, 1). If the definition of 
p and r is extended in the obvious way to t — \, the situation remains 
the same, except that now the whole line-segment joining p and r is 
part of the boundary x^-\-X 2 =D of 6^. Hence we may deduce only 
that M has a basis consisting of (—1, 1) and some point on -|-X 2 = Z). 

For this type of proof compare Ollerenshavv (1945b). 

For later use we note that we have also proved the 

Corollary 1. The only critical lattices for 
— \<X^X2<k, 

are those of type (ii), where now t is allowed also to take the value 1. 

For the other lattices of type (i) have a point on — \<x^X 2 <D, 

I Xj -f- Xj I = Z). Here our usage differs from that of Mahler (1946 a), 
since he calls a lattice admissible for a set ST if it has no points other 
than o in the interior of y. Thus Mahler calls the lattice of type (i) 
admissible (and so critical) for the set of the corollary. 

Lemma 10 may be regarded as a more precise version of Theorem IV 
of Chapter II. To make the connection more clear we prove 

Corollary 2. If k is an integer, the critical lattices of type (ii) are 
admissible for 

— 1< XjX2< k. 

For the general point of a lattice of type (ii) is 

X = -f Mjr, 

where Mj , Mj integers. Then 

X 1 X 2 = («i^i + « 2 A) i^lp 2 + “ 2 ^ 2 ) = -u\+kUiU 2 + ku\. 

We showed in §4.4 of Chapter II that —u^+ku^u^ + kuX does not 
take any values strictly between — 1 and -f k when ^ is a positive 
integer and Mj , are integers not both 0. 

V.7. Reducibility. It may happen that if ST^ is a star-body, there is 
some star-body ST^ which is properly contained in ST^ but which has 
the same lattice constant: —A{£T(). We say then that ST^ is 

reducible. If no such ST^ exists, then .5^ is said to be irreducible. Criteria 
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for the reducibility of a bounded star-body have been given by Mahler 
(1946a) and Rogers (1947a). Later, Rogers (1952a) gave a most 
ingenious example of a reducible star-body which does not contain an 
irreducible star-body of the same lattice constant : but he was able to show 
that if a rather wider class of point sets, which he calls “star sets’’, 
is considered, then every bounded reducible star set contains an irreducible 
star set. Convex 2-dimensional sets were considered in great detail by 
Mahler (1947a). Mrs. Ollerenshaw (1953b) has shown that the 
M-dimensional unit cube is irreducible for all n and that the unit sphere 
is irreducible at least for 5- She shows further that a 3-dimensional 
cylinder is irreducible if its 2-dimensional base is irreducible. 

We refer the reader to the papers quoted for the general theory. 
The following lemma shows in a simple case the sort of ideas involved 
in the proof that a star-body is irreducible. 

Lemma 11. The star-body 
is irreducible. 

For suppose .5^ is a star-body strictly contained in 2 . Then there 
is a point p on the boundary of S which is not on the boundary of 6 ^. 
But now (§ 6.4 of Chapter III) there is a critical lattice A of ^ having 
points at The only other points of A on the boundary of ^ are 

the points ±q, ±r which, together with are at the vertices of a 
regular hexagon. Since the lattice A must be admissible for 6T. 

But now the only points of A on the boundary of can be ± qr and r. 
These points clearly do not satisfy the criterion of Theorem VII, Corollary. 
Hence A is not critical for 6^, that is 

A( 6 ^)<d(A) =zl(^). 

Since 6 ^ is any star-body contained in this proves the lemma. 

A similar proof shows that Mullineux’s star-body ^ defined in 
Lemma 10 is irreducible. Again, if p is a point on the boundary of iT’ 
then, apart from a finite number of exceptional p, there is a critical 
lattice for 6 ^ which has only three pairs of points ±p, and ±r 
on the boundary of ; and the points ± 9. ± cannot be the only 
points on the boundary of a critical lattice of any set ^ contained in 
The finite number of exceptional points p for which such a lattice does 
not exist cannot affect the argument, since if .T is properly contained 
in there are infinitely many boundary points of 6 ^ which are not 
boundary points of S'. 

V.7.2. If S’ is an unbounded star set but there is a bounded star 
set S' contained in S’ such that A(S') =A{S), ihtn S’ is said to be 
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boundedly reducible. Corollary 2 of Lemma 10 shows that the 2-dimen- 
sional star-body 

—\<x^x^<k ( 1 ) 

is boundedly reducible when ^ is a positive integer, since zl ( .5'’*) = 
where is Mullineux’s set 

—i<XiX2<k, \xi-\- x^\<{k^ Ak)^. (2) 

On the other hand, is not boundedly reducible for every k. 
Thus we saw in § 4-4 of Chapter II that the critical lattices M for 
are admissible for |«i^ 2 |<||, and so have no points on XiX^= — \. 
But then, precisely as in the proof of Lemma 10, M cannot be critical 
for a bounded set ^ contained in since the lattice of points 

{{i - e)xi + £X2, + (1 - e) Afj}, (x^,X2)eM 

would be admissible for ^ for sufficiently small e. 

The proof of Theorem VII of Chapter III shows that the 2-dimen- 
sional star-body 

\xl+xl\ <1 

is boundedly reducible, since the proof used only a bounded portion 
of the set. Mahler (1946a) has developed criteria for sets of certain 
types to be boundedly reducible if their critical lattices are known. 
Bounded reducibihty is further discussed by Davenport and Rogers 
(1950a). Davenport and Rogers introduce the concept of full redu- 
cibihty. If is a set contained in the set 6^ and A(^) =A{S^) then 
clearly every lattice critical for is also critical for but in general ^ 
might have more critical lattices. For example when ^ is a positive 
integer the sets defined in (1) and (2) have the same lattice constant, 
but the critical lattices of of the type (i) of the enunciation of 
Lemma 10 will in general have points in 5^. On the other hand, the set 

—\<x^X2<k, -f- ^ (^* -f 4^)^ 

has no more critical lattices then by Lemma 10 Corollary 1. If an 
unbounded set contains a bounded set .^with the same lattice constant 
and no more critical lattices then y is said by Davenport and Rogers 
(1950a) to be fully reducibleL They, following Mahler, use the concept to 
show that lattices of certain types have infinitely many points in certain 
regions. We shall be discussing this from a rather different point of 
view later in Chapter X. We do not discuss bounded and full reducibihty 

^ Their definition is not quite the same as ours since they use Mahler’s defini- 
tion of an admissible lattice. But it is not difficult to see that it is equivalent 
to ours. 
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further but refer the reader to the papers quoted. The following example 
illustrates the connection with the existence of infinitely many lattice 
points in sets. 

Lemma 12. Let k he a positive integer and A a lattice with 
d(^)^{k‘^+Ak)K 

Then there are infinitely many points of A in 

(3) 

There are infinitely many points of A in 

(4) 

except when A is critical for 

If A contains a point (0, x.^ with it contains all the points 

(0,rx2) (r = 1, 2, 3. •• •) and so the lemma is trivially true. Otherwise it 
suffices to show that for every e>0 there is a point (%. x^) of A in 
for which | % | ^ e ; and that this point is in .5^ unless A is critical for . 
Let t be any positive number. Then the lattice A, of points 

{x„x2)={tX,.t-^X2) {X„X2)eA (5) 

has the same determinant as A. Hence by Lemma 10, Corollary 1 
there is a point of A< in 

— i-^XiX2'^k, + ;t2| = (^^ + 4^)^; (6) 

and indeed in unless A; is critical for 5^. But now the region (6) 
is bounded, so all the points of (6) satisfy 

kil ^7 

for some number y which depends only on k. Hence, by (5), the original 
lattice A contains a point (X^ , Xj) =b o such that 

~i^X^X2^k, iXil^yTL 

Further, A is critical for if and only if A, is. Since yt~^ is arbitrarily 
small when t is a arbitrarily large, this proves the result. 

V.8. Convex bodies. For convex bodies stronger results than Theo- 
rem VII hold about the lattice points of a critical lattice on the boundary. 
The following theorem of Swinnerton-Dyer (1953 a) generalised an old 
result of Korkine and Zolotareff for spheres. 

Theorem VIH. Let XT he a bounded open symmetric convex set in 
n dimensions and let A be a critical lattice for Jf. Then A has at least 
1) pairs of points the boundary of 
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We reproduce Swinnerton-Dyer’s elegant proof. Let b^, ...,b„he 
a basis for A and let A' be a lattice with the basis 6' where 

b'j-bj=ri Z (1) 

IgiSn 

and the a^j and rj are real numbers to be determined later. Let 
±Pi, ±Pn be the only points of A on the boundary of JT and let 

iPi. ±Piv be the points of A' which correspond to them in an 

obvious way. Let tTi , . . . , ir^^ be tac-planes to Jf at p^, (Theo- 

rem IV of Chapter IV). If there is more than one tac-plane, we choose 
one arbitrarily. We then impose on A' the condition that p'j lies in tTj 
tor By (1), and since pj hes on iTy, this imposes a condition 

of the type 

2a,,.4f> = 0 (1g/^iV), (2) 

where the numbers depend only on the point p j and the choice of 
tac-plane tTj. We also impose the conditions 

( 3 ) 

The total number of linear conditions (2) and (3) imposed on the 
numbers is — 1) -|-iV. Hence if W<^m(«-|- 1), there exists a 
set of real numbers not aU 0 satisfying (2) and (3). We select any 
one such solution and keep it fixed in what follows. 

Since the points Pj he on tac-planes to the open set JT, they do not 
lie in Jf. When |»;| is small enough, there are no further points of A' 
in JT other than o, by the argument of § 6.1. Hence A' is admissible 
for jT. Since A is critical, we must then have 



i (A') = 1 det (b; b;) I ^ I det (b„ . . . , b„) I = d ( A) = zl (.;r) ; 

that is 

/l-faiijj a^^rj a^„rj 

l^detf I+« 22 »? a^nV 



«nl»? «n2»? ^+‘^nnvl 

= 1 + Av + + (say) . 

Since this must be true for all sufficiently small values of | ?; j , it follows 
that 

=Z = 0 

i 

and 

Ai=-Z + E ^ 0. 



*<; 



»</ 
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Hence on using the symmetry conditions ( 3 ) we have 
0^2A^-Al = - Z 

lS»Sn 

ISjSn 

Hence aij=0 for all i and /; which is a contradiction. The contradiction 
arises from the assumption that there are fewer than |n(« + t) pairs 
of points of A on the boundary of Jf. Hence the theorem is established. 

V.8.2. For bounded symmetric convex star sets the considerations 
of § 6.3 about the maximum number of points of a critical lattice on 
the boundary and about their index may be made much more precise, 
as was shown already by Minkowski. His results apply indeed not 
merely to critical but to all admissible lattices. We recollect that a 
body JT is strictly convex if every point tp + {\ — t)q (0<<<!) is an 
interior point of Jf whenever p and q are distinct points in or on the 
boundary of JT. 

Theorem IX. Let A be an admissible lattice for the convex symmetric 
open set Ot. Then there are at most -2 ( 3 "— 1) pairs of points ±a of A 
on the boundary of Jf. If is strictly convex, the number of pairs is at 
most 2 "— t. 

The proofs are very simple. Suppose first that Jf is strictly convex. 
Let , . . . , 5„ be any basis for A and let 



a = hM„b„ 

be a point of A on the boundary of Jf. Then not all of , . . . , u„ are 
even, since otherwise would belong to A; and |a is certainly an 
inner point of Jf. Let now 

a' = «;5i + +u'„b„, 

if possible, be another point of A on the boundary of such that' 
u'j = Uj (mod 2) (1 ^ / ^ «) . 

Then |(a+a')€A. By the strict convexity, |(a+a') is an inner point 
of Jf and so must be o, that is a' = — a. Hence the total number of 
boundary points is at most the number of residue classes for (m^ , . . . , m„) 
modulo 2 excluding (0, ..., 0), that is 2"— 1, as required. 

When K is not strictly convex one must work with congruences 
modulo 3; the details are left to the reader. 

Theorem X. Let of be a convex symmetric open n-dimensional set 
and A an admissible lattice for Of. If a^, are points of A on the 

1 The notation means that Uj — u'j is divisible by 2. 
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boundary of Jf then their index I satisfies 

I^n\. ( 1 ) 

There is inequality in (1) if is strictly convex. 

If Oj , . . . , a„ are linearly dependent, then their index is 0 and there 
is nothing to prove. Otherwise, every point c of A may be put in the 
shape 

( 2 ) 

where v^, . . . , v„ are rational numbers. The sets of numbers v such that 
(2) is in A clearly form a lattice M of determinant 

cf(M) = =/-i 

^ ’ |det(a^ o„)| 

Hence, by Minkowski’s convex body Theorem II of Chapter III, there 
is point u 4= o of M such that 



v,\+---+\v„\^(nlliyi\ ( 3 ) 



Let F be the distance function associated with St", so that 
F{af)=\ ( 1 ^/^ m ). 

For the eg A given by (2) and (3) we thus have by the convexity and 
symmetry of S^, that 

F(c) ^ |i)i|F(ai) ^ h lv„lF(a„) ^ («!//)>/”. (4) 

But F(c)^1 since A is admissible for St and so I^n\ as required. 
If I =n\ and St is strictly convex we should have F(c) < 1 unless both 

M is a critical lattice for It'll 4 hli^ni<f every point of M on 

the boundary has « — 1 of the co-ordinates v^, equal to 0. But 

these two requirements are incompatible by Swinnerton-Dyer’s 
Theorem VIII. 

The^ estimate for I in Theorem X can usually be much improved 
and more information obtained about the relationship of Oj , . . . , a„ to 
a basis for the lattice. Thus for « = 3 we have 

Corollary. If St is strictly convex and « =3. 7 = 1 or 2. If 

7 = 2, then a 2 -fa 3 )gA. 

For 7^5- If7 = 5, then there are integers u^,u^, % not all divisible 
by 5 such that 

c =|(«iai-|-M 2 a 2 -f M3 03 )gA. 



1 We do not use the rest of § 8.2 later but do refer to it at the end of § 8.5- 
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We may suppose that 5 does not divide % and, by taking 2 c instead 
of c if necessary, that 

Ml H= i 1 (mod 5 ) . 

Hence by adding appropriate integer multiples of «i, a^, Og to c we 
may suppose, without loss of generality, that 

« 1 =± 1 , 1 m 2 |^ 2 , \us\^ 2 . 

But then by the strict convexity we should have 

F{c) < iF(ai) + 1 F(a,) + iF{a,) = i ; 

a contradiction. Hence 74 = 5 - Similarly 7 4= 3 - 

Suppose now 7 = 4 . Then there exists a base bj, b2, 63 for A such 
that 

®2 = Ujibi -p 

®3 ~ ^ 31^1 "b 1^32 ^2 "i" 1^33 ^ 3 < 

where 

and 

i'iiV22i^33 = 4 . 

Then Hn = t, since otherwise ^OjeA andF(^ai)<F(ai) =t. If 

then either or |(ai + a2) is in A; and again we have a contradiction. 

Hence 

I'll = 1^22 = 1; so 1^33 = 4. 

If Vgi were even, we should have either ^ «3 or ^(02 + 03) in A; so D31 
is odd. Similarly, V32 is odd. Hence there is a point 

C =i(MiOi + M2«2 + « 3 )eA, 

where Mj , «2 are odd. By adding integer multiples of and Og to c, 
we may suppose that Mi=T:t, M2=J;1. But then 

F(c) < i{F(ai) + F(o2) + F(o3)} = | < 1 . 

Hence 74 = 4 . 

Finally, when 7 = 2 it follows, just as for 7 = 4 , that the only pos- 
sibility is i'ii=ii22=l, ^21=0 and ^33=2. Further, the argument that 
V3 i,V 32 are both odd continues to hold. Hence ^(ai4-a2+®3)^^- 

V.8.3. When jT is a bounded symmetrical strictly convex 2-dimen- 
sional set, the lower bound 3 for the number of pairs of points ±0 of 
a critical lattice on the boundary given by Theorem VIII coincides with 
the upper bound give by Theorem IX. We have indeed 
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Theorem XI. A. LetCf he an open convex symmetrical 2-dimensional 
convex body. Then a critical lattice f\oj has six points ±9, rtf' 
on the boundary of JT such that 

p + q+r = o (i) 

and any two of p, q, r is a basis for A. 

B. Further, if ±g, ±r are any points on the boundary of X 
such that (1) holds, then the lattice M with basis p, q is admissible for Ct. 
There are no further points of M on the boundary, except when JT is a 
parallelogram and two of p, q, r are mid-points of its sides. 

The first part of Theorem XI is an almost immediate consequence 
of the last three theorems. By Theorem VIII there are three pairs of 
points ±p, ±q, d;r on the boundary of Jf. By Theorem IX, the index 
of p,q is 1 or 2. Since ^p, ^q are (inner) points of Jf, they cannot 
belong to A. Hence, if the index is 2, the point + g) is in A. It is 
also in or on the boundary of Jf, the latter only if is not strictly 
convex. If the index is 2, we may thus take ^(p + g) =g' instead of q. 
The index of p and g' is f . Hence without loss of generality the index 
of p and g is 1. Hence r = up-\-vq for some integers u and v, where 
1 M I ^ 2, 1 1 ^ I ^ 2, since the indexes of p, r and of g, r are at most 2. Not 
both u and v can be even, since otherwise would be in A. If, say, 
u — ±2 is even, then v = ± 1 is odd, and r' = |(r + vg) =^«p + vg 
is in or on the boundary of JT. It must be on the boundary since A is 
admissible. Hence by taking r' instead of r we may suppose, without 
loss of generality, that | m | = ] v | = 1 . By changing the signs of p and g, 
where necessary, we may suppose that M=y = — t, that is, that (f) 
holds. This proves A. 

It remains to prove B. Suppose, if possible, that the point 

c = up + vg 
= iy — u)q u)r 

= {u — v)p v)r 

is in or on the boundary of Jf for some integers u,v. If, say, | m | > | v | + 1 , 
then the point 

p = u^^c — vw^q 

would be an inner point of Jf, because we should have | | + 1 | < 1 . 

Hence from the three expressions for c we deduce that 

|1«| - Ivll^l, 

\\u - 1^1 - 1m|| ^1, 
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It is easy to see that the only integral solutions of those inequalities 
giving primitive lattice points distinct from ±g, i*" are 

±{u,v) ={2.i), {i.2) or (1,-1). 

Hence after permuting p, q, r cyclically if need be, we may suppose 
that c=p — q is in or on the boundary of Jf. Since now 

p=jc-^r, g = 

the only possibility is that c is a boundary point. 

We now show that JT contains the whole parallelogram ^ of points 



with 

Indeed 



X = kp /iiq 

max{|A|,|/i|}<l. 



05 = pc + ar, 

where 

lei +kl +iU+/*| =max{|A|,l/^|}. 

But now the area V (^) of ^ is 

V{^) =4|det(p, g)| =4i(M). 

On the other hand, by Minkowski’s convex body theorem, we have 

V{jr)^4d(M). 



Since JT includes and since JT is open, the only possibility is that 
jT coincides with This concludes the proof of the theorem. 

Theorem XI gives one a ready criterion for finding the lattice constant 
of 2-dimensional convex star-bodies. It is easy to see that if p is a 
given point on the boundary of JT, then there is precisely one hexagon 
of boundary points ±p, ±g, for which (1) holds. The lattice 
constant of JT is then the lower bound of det [p, g) for these hexagons. 

V.8.4. As an application of Theorem XI we prove 

Lemma 13. Let ^ be a convex symmetric open hexagon. Then 

( 1 ) 

The only critical lattice M is that which has points at the mid-points of 
all the sides of ST. 

By Minkowski’s convex body theorem, 

A[ST)^\V[ST). 



Cassels, Geometry of Numbers 
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( 2 ) 
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Let the vertices of taken in counter-clockwise order be 
a, — b,c, ~ a,b, — c. 

Then the lattice M of the lemma has basis ^{a — b) and \[b — c). It 
clearly contains also \ (c — a). Hence, by Theorem XI, M is ,5'’-admis- 
sible. We now show that 

(3) 

On dissecting 6^ into triangles with a vertex at o, we have 

— V(£^) = det (a, b) -)- det (b, c) + det (c, a) = 4 det (m, v) 

on putting b =a + 2u, c=a + 2». This proves (3). Then (1) follows 
from (2) and (3) since M is ,5^- admissible. 

Now let A be any critical lattice for 6^. Then d{A) If 

A did not have a point on a particular side of there would be a sym- 
metric convex set larger than ^ which contained no point of except 
o; which would contradict Minkowski’s convex body theorem. Hence, 
by Theorem XI, A has precisely 6 points ±p, ±9. on the boundary 
of y’; one on each side. If, say, the points Trp are not the mid-points 
of their sides, then by rotating slightly the sides about ±p, leaving 
the other pairs of sides fixed, it would be possible to find a convex 
symmetric set of volume V(.f')> V(y’) containing no points of A 
except o; again contradicting Minkowski’s convex body theorem. 
Hence rtP, i9> i*" ^re the mid-points of their sides, and A = M. 

It would, of course, be possible directly to compute the determinants 
of all lattices having points p, q, r with p-f-g-br=o on the boundary 
of and to show that M gives a minimum. 

V.8.5. Minkowski (1904a) has extended the argument of Theorem XI to 
3 dimensions and proved the following. 

Theorem XII. To find the lattice constant of an open symmetrical convex 

set in 3 dimensions it suffices to consider the minimum of the determinants of 
lattices generated by three points Oj, Oj, O 3 on the boundary of A and satisfying one 
of the folloiving three conditions : 

(A) the points a^ — a^, a^ — a^, a are on the boundary of awd — Oj -f O 3 , 

outside JT. 

(B) the points Oj-faj, Oj-t-Oj, Oj-t-Oj are on the boundary of Jf and Oj-f Oj-t-Oj 
is outside JT. 

(C) the points Oj-f Oj, Oj-f Oj-f Oj and aj-|- Oj-)- Oj are on the boundary of JT. 

We refer the reader to the original paper for the proof. Alternatively the 
reader may construct a proof by combining the ideas of the proof of Theorem XI 
with those at the end of § 8.2. The corresponding result in 4-dimensional space, 
which is fairly complicated, has been found by K. H. Wolff (1954a), who states 
that some of the auxiliary results are due to E. Brunngraber (1944a). 
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Minkowski (1904a) used Theorem XII to find the lattice constant of the 
octahedron 

l^il + l^ni + kal < 1. 

namely I9/108. The lattice constants of further convex 3-dimensional bodies have 
been determined by Chalk (1950a) and Whitworth (1948a and 1951a). In all 
cases a considerable amount of rather tedious detail is necessary. 

V.9. Spheres. We now consider more particularly the «-dimensional 
spheres 

\xY=x\^ (1) 

We denote the lattice constant of by 

( 2 ) 

The value of is known for 1 ^«^ 8 , see Appendix A. We here find 
again 7^, which we already found in another context in Chapter II, 
Theorem III. From this the value of will follow almost at once by 
a general theorem of Mordell in Chapter X. 

We must first prove a result for spheres which is more precise than 
the mere application of Theorem X. 

Theorem XIII. Let t\ he a lattice admissible for 2>^\ |a;|^<l; and 
let aj, .... a„ he points of A on the boundary of Then the index I 
of a-^, ... ,a„ satisfies 

i^{d(A)}-^^{A{^„)y^ = r-\ (3) 

For |a^.| =1 (1 and so, by Hadamard’s Lemma 9 , we have 

|det(ai, ...,a„)l ^ |Oi| ...[aj = 1 . 

Since 

j ^ |det(Oj, ...,o„)| 

the first half of ( 3 ) follows. The second half of ( 3 ) is a trivial conse- 
quence of the definition of 7^. 

Corollary. If n = } the index is 0 or \. 

For has volume 4nj}, and so 

7^^ti/6>|, 

by Minkowski’s convex body Theorem II of Chapter III. 

Theorem XIV. 

n = 2-4. 

A critical lattice for 2^ has a basis such that 

I MitHi + «2m2 -f U^m^Y = “1 + «2 + «3 + «2«3 + “3«1 + “l 

identically in , Uo and . 
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Let A be a critical lattice for ^3. By Theorem VIII there are at 
least ^n(« + 1)=6 pairs of points ±m of A on the boundary of ^3 
and by Theorem VII there is a hnearly independent set of 3, say m^, 
By Theorem XIII, is a basis for A. If 

m = Ml nil + MjrUj + 

is another point of A on the boundary of 2 ^, the only possible value 
for the Uj are 0, ± 1 by Theorem XIII. There can be at most one such 
pair ±m with For if, say, 

m = MiWii + M^mj + M3»n3, «i 1^2% 4 = 0 , 

m' = M^mi + «2^2 + W3»n3, mJm2M3=(=0, 

the index — of is even, so must be 0. Similarly 

Mj M3 — M2 M3 = Ml M3 — Ml M3 = Mj M2 — Ml Mj = 0 , 

so m' =-±_m. Hence there must be at least one point Minii + M3 m3 

with Ml M2 M3 =0 on the boundary of ^3 other than ±mi, ±m2, img. 
We may suppose without loss of generahty that it is 

m^ = mi — m2 . 

Then neither mi + m2 nor mi+m2±m3 can occur as boundary points, 
since they would give index 2 with m3 and tn^. Hence at least two 
of the remaining possibihties 

nil ± m3, m2±m3, mi-m2±m3 

must occur. Since + and cannot both occur, 

we may suppose without loss of generahty that 



mg = m2 — m3 



occurs. Then and mi— m2— m3 do not occur, since they 

give index 2 with m2 and m^: and mi + m3 cannot occur, since it gives 
index 2 with and m^. Hence the only possibihties for dbw*s ^re 

m3 — iHi or mi — m2 + m3. 

In the second of these cases take m^ instead of m3. Then without loss 
of generahty 

mg = m 3 — iHi . 

Write 

/(Ml, M 2 , M 3 ) =|Mlml + M2»W2 + «3»W3|^ 

where u^,u^,u^ are variables, so /(u) is a quadratic form. Then 



/(t,0,0) =/(0,l, 0) =/(0,0,l) 

= /(1,0 ,-l) =/(0,l,-t) =/(l,-l,o) =t. 
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Hence 

/(«) = «1 + «2 + m| + «2«3 + «3«1 + “l“2 

with determinant and so 



{det(mi,m2,m3)}* = |, 

as required. 

V.9.2. Let y be a star-body and A an y-admissible lattice. We say that 
A is extreme for y if there is a neighbourhood 2 of A, in the sense of § 3.2, in which 
every y-admissible lattice M satisfies 

d(M) ^ d{h). 

Clearly a critical lattice is extreme; but an extreme lattice need not be critical. 
Some of the results proved already extend to extreme lattices, notably Swinnerton- 
Dyer's Theorem VIII. 

The extreme lattices of n-dimensional spheres have been exhaustively studied. 
For example there are six distinct types of extreme lattice for the 6-dimensional 
sphere as was shown by Barnes (1957b). There is a general theorem of Voronoi 
(1907 a) which helps to characterise the extreme lattices of an n-dimensional sphere 
(they are "perfect” and "eutactic”). Barnes (1957a) has given an extremely 
elegant proof of VoronoTs characterisation. Unfortunately we cannot discuss 
these points further here, so we refer the reader to the two papers by Barnes 
where there are further references to the copious literature. 

V.IO. Applications to diophantine approximation^. The theory of 
Diophantine approximation deals with the approximation of rational or 
irrational numbers by rational numbers with special properties. The 
geometry of numbers has many applications to Diophantine approxima- 
tion. The author’s recent Cambridge Tract [Cassels (1957a)] deals 
with Diophantine approximation and we do not intend to repeat what 
was done there. We give however a theorem of Davenport generalizing 
work of Furtwangler which is an interesting application of Mahler’s 
compactness techniques. 

First, we note an obvious consequence of Minkowski’s linear forms 
Theorem III of Chapter III. Let , . . . , be real numbers and Q an 
integer. By Theorem III of Chapter III there exist n -f 1 integers 
Uq, not all 0, such that 

(l^/^«), (1) 

( 2 ) 

since together with Mq form «-|-l linear forms in 

Mq, . . . , M„ with determinant 1. Were «o=0, we should have | m,| < 
so Mj = 0 Hence «o=t=0> replacing Mj, by 

1 Not used later in book. 
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— Mq, if need be, we may suppose that 



0<«o< Q- 

Further, (1) may be written 



• ■»/- 



< 



«„ QVn 



(2') 



(1') 



which shows that the are good rational approximations to the , 
all with the same denominator Mq . 

We may look at (1) and (2') from another point of view. On elimi- 
nating Q we have 

«o{maxJ«oi9y- (3) 



There are in fact infinitely many solutions Mq>0, Mj, ..., of (3). If 
all of i?i , . . . , are rational, this is trivial since then there exist integers 
Vj>0, Vj, such that 

and then we may put 

My = r Vj (0 ^ ^ m) , 

where r is any positive integer; and then the left-hand side of (3) is 0. 
Otherwise we may suppose that is irrational. Suppose that R integral 
solutions mW have already been found with m^'*>0. 

Since is irrational, we may choose Q so large that 

For this value of Q the solution of (t) and (2') gives a solution of (3) 
which is clearly not identical with any of the earlier ones. 

V.10.2. For different purposes one may be interested in different 
properties of the approximations My/M^ to the ^y . For example, instead of 

max I Mo??, — M, | 

we may wish to make 



2 -«;■)* 


(1) 


l^;^n 




or 




n l«0^, - 


(2) 






small. Or again one may be interested in 
of the type 


"asymmetric” inequalities, 


- ^ - «;■ ^ 


IIA 

IIA 




Applications to diophantine approximation 



167 



where and are positive numbers. All these different problems 
may be brought into one general shape. Let 0{xi , . . . , be a distance- 
function of n variables. How small can' 

Uq0 Wj ) • • • ) ^n) 

be made for infinitely many sets of integers Ug>0 and «j, We 

write 

D(0:'&i , = lim ini u„0’‘ — u^, . . . — u„) (4) 

1*0— >oo 

tto, u ^, integers 

and 

D(0) = sup (5) 



so that D{0) is the number we wish to estimate. 

The non-negative function F(x^,...,x„) of m + 1 real variables 
defined by 






x„,...,x„) 



x„0”{x^,...,x„) if ;to^0 
- x^0”(- x^. - x„) if Xa^O 



( 6 ) 



is a distance-function when 0 is a distance function of n variables: 
since it clearly has the three defining properties that it is non-negative, 
continuous and satisfies 



F(tx^,...,tx„) =tF{x„ x„) 



when t>0. By definition, F is symmetric: 

F{-x^,...,-x„) =F{Xg,...,x„). (7) 

It satisfies the identity 

F{t” Xg, X^, . . . , X„) = F{Xg X„) (8) 

for any i>0, since 

0(ri x^,...,t^ x„) = ri 0(% ,...,x„). 

As in § 4 of Chapter IV we write 

F»+i(A) 

'’<^1 =“ P - 3 IA )-’ 

where the supremum is over all (« -f-l) -dimensional lattices, so that 

d{F) ={d(^)}-L 

where 6^ is the (n + 1)-dimensional star-body 
y': F{xg,....x„)<\. 

Davenport’s result may now be put in the following shape. 



’ By is meant the n-th power of 0 . 
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Theorem XV. Let 0 and F be related as above. Then 

D{0)^d{F) (9) 

always. If 0{x)=O only for x=o, then 

D{0)=d{F). (10) 

The first part of Theorem XV is due essentially to Mahler and is 
related to the theory of automorphic bodies which we shall study in 
Chapter X. When D{0)=O, there is nothing to prove. Otherwise, 
let c be any positive number such that 

c<D{0). (11) 

Then, by the definition of D{0), there are real numbers and 

an integer such that 

«o («o«?i - «i . • ■ • . - M„) ^ c , (12) 

whenever u^, ...,u„ are integers and 

(13) 

In particular, are not all rational; and so there exists a 



number x>0 such 


that 






max 1 — M, 1 ^ ix > 0 

lS;gn' ' '' 


(14) 


for all integers , . 


. . , with 






0 < M() ^ Ug. 




Clearly 


X ^ ^ < 1 . 


(15) 


Let be the 


M -f 1 -dimensional lattice of points 






■ > ^«) = (*<0> ^o'^l %.•••> > 


(16) 


where u^,...,u„ run through all integers. Clearly 






<f(Mi) =1. 


(17) 


The function 






F^ (xq , . . . , 


, , cV(»+i) , 

x„) =max F(xo,...,xJ, max \xA 


(18) 



is clearly an (n + 1)-dimensional distance-function and 



F^(-x) =F^(x) 

by (7). We show now that 

Fi’‘+i(Mi)^c. 



(19) 

( 20 ) 
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Consider a point (16) of where, by (I9), we may suppose that Mq^O. 
If «o=0 but not all of «i, u„ are 0, then the second term of the 
outer maximum in (1 8) is 



cl/(«+l) 

n 



max M,- ^ 

ISiSn' '' 



cl/(«+l) 

X 






by (15). If 0<«o^t4> then the second term of the outer maximum 
in (18) is still by (14). U the first term of the outer 

maximum in (18) is by (12). Hence in any case, 

Fi(x)^c‘/<"+i> 



for all a: S Mi except o. This completes the proof of (20). 
For positive integers r = 1, 2, ... write more generally 



F^{xq, .. 

Then 


1 [77/ X cl/(»+l) 1 |1 

. . , x„) — msix r(Xa, , xj, max hr,- 

[ rx ^ . 


(21) 


and 


F{x) g F,{x) g Fi(x) 


(21') 




lim F^{x) =F{x) 


(22) 



uniformly in any bounded set of points x. We have the identity 

F^{xo,...,x„) =Fi{r”xg,r-^Xi,...,r-^x„), (23) 

by (8). 

Let M, be the lattice 



Clearly 

and 



M,: {r-”xo,rxj^,.,.,rx„), ®eMi. 
<f(M,)=i(Mi)=1 
F,»+i(M,) =Fi»+i(Mi)^c, 



(24) 

(25) 



by (17), (20) and (23). Consequently, by (21'), we have the weaker 
assertion 

Fi^+i (M,) ^00 (1 ^r<oo). ( 26 ) 



By (24), (26) and Theorem IV Corollary, there exists a convergent 
subsequence of the M,, say 

By (24) we have 



^f(N) =1. 



(27) 



Since (22) holds uniformly in any bounded set, we have 



F”+i (N) ^ lim sup (M, ) ^ c , 
*-►00 * • 



(28) 
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by (25) and Theorem II. Hence 



d (F) — sup 
A 



d(A) 



= d(N) 



^ c. 



Since c was any positive number smaller than D{0), this proves 
d{F)^D{0), the first part of Theorem XV. 

The second part of Theorem XV requires quite different techniques 
and uses the basis constructed in Theorem II of Chapter I. By the 
Corollary to Theorem VI, there is a lattice A with 



d{A) =1 

and 

F”+1(A) =S{F). 



(29) 

(30) 



We denote the (n + 1)-dimensional vector (vq, . . . , x„) in which = 1 
but the remaining co-ordinates are 0 by 



n-l ' 

e, = \0, . . . , 0, 1 , 0 , . . . , oj (0 ^ / ^ «) . 

By Theorem II of Chapter I, with e= ^ and «-f t for n, there exists, 
for all sufficiently large numbers N, a basis a^,a^, a„ of A such that 



Then 

where 

and 



\Oj — N Bj\ < (1 ^ ^ m ) . 

0:gt^n 

I A ■ — 1 1 ^ (1 ^ / ^ «) 



(31) 

(32) 

(33) 

(34) 



I 1 g ^n, O^i^n, * 4= /) . 

Since 0 ^, Oj, are linearly independent, there are real numbers 

Ao, Aj, ..., A„ such that 

®o = ■^0 ®o + ■^1 ®i + ■ ■ • + A„ a„ , 
where we may suppose that 

Ao^O, 



on taking — o„ for if necessary. Since d(A) = i, we have now 
Ao = Ao|det (a„,...,a„)\ 

= |det(eo,ai,...,a„)| 

= N"{\ +0{N-i)}, 

where the constant implied by the 0 depends only on n. We may 
thus write 

®0 “ ^0 1^1 ®1 T * * ' T l^n » 



(35) 
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where are certain real numbers, and 

^=Ao-i = iV-»{i+ 0 (iV-i)}. ( 36 ) 

Let (5' be any number such that 

d'<d[F). 

We wish to show that 

lim inf 0 ” , . . . , Mjt?,, — «„) > d' (3 7 ) 

Uo -»-00 

for the t?i, we have just constructed; provided that N is greater 

than some Ng which may depend on S' and the function 0. After the 
first part of Theorem XV, this will complete the proof of the theorem. 
If (5 (F) = 0 there is nothing to prove. Otherwise we may suppose without 
loss of generality that 

0<d'<d(F). (37') 

To prove (37) we may clearly confine attention to integers m#, 
if any, for which 

«o>0, u,0”{y„...,y„)^d{F), (38) 

where we have put 

y,- = (1 ^/^m). (39) 

So far we have not used the fact that 0(ac) = 0 only for x=o. 
By Lemma 2 of Chapter IV, this implies that 

0(ac) ^ c|a;| ^ cmax[|;ri|, ..., lx„|j 

or some c> 0 . Hence, by ( 38 ), we have 

Mq max c“'‘ 6 (F) . (40) 

We now consider the point 

F = Motto- Ml Oj M„a„ 

of A. By ( 35 ) and (39) this is of the shape 

Y=ixu^e^F 2 y,«/; 

and so, by ( 32 ), has co-ordinates (Yq, . . . , Y„), where 
Yo=A«o + A^ Z Jjtja, 
y.= N z yjiii (i^f^«). 



( 41 ) 

(42) 
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Let e be an arbitrarily small positive number to be determined later. 
By (33), ( 34 ) and Lemma 3 , the inequality 

(1 +e) ^(yt y«) 

holds for all real numbers yj , ■ ■ • , y„ whatsoever, provided that N is 
greater than a number depending only on the number e and the func- 
tion 0. Hence, by (42), 

0{Y„....Y„)^{\+e)N0[y,.....y„). ( 43 ) 

By (40) and (41), we have 

0 < ytt (1 4- e) Mfl (all «o^ ^>) (44) 

for some which will depend, of course, on N. But now FeA and 
F'*+*(A) =d{F), by hypothesis. Hence 

d{F)^Y,0’'{Y„....Y„). (45) 

by the definition ( 6 ) of F. From ( 36 ), (37'). (43) and (44), we have 

uo0’'{y„ ....y„)^{Ny)-^{i + e)-'-^d{F)> d' (aU u,^U,), 

provided that first e is chosen small enough, then N is chosen large 
enough, and finally U^is chosen large enough. This concludes the proof 
of ( 37 ), and so of the theorem. 

V.10.3. The condition that 0{x) =0 only for a: =0 is necessary for 
the second part of Theorem XV. The case when n = 2 and 

0^ix^,x^) =\xiXi\ 

represents a fascinating problem of Littlewood. It is not in fact 
known whether there exist numbers and such that 

lim inf — uA Uoi^z — “al > 0, 

t4,— ► 00 

where «o, «i. «2 are integers. The corresponding function F{Xf), x^, x^) 
is given by 

F^(x„,x^,Xi) = \x^XiXi\: 

and for this we have Davenport's result that 

<5(F) = 1/7, 

which we shall prove in Chapter X. But it follows from work of Cassels 
and Swinnerton-Dyer (1955 a) and from Davenport’s results about 
the successive minima of F, that at least 

D(0)g 1/9-1. 
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There is a companion result to Theorem XV, also due to Davenport, 
which relates to the approximation of a single linear form to 0. Here 
one is concerned with 



= hminf |Mo + “ii?id 

max 

Ug, integers 



where the condition hw»^n = 0 clearly be omitted 

if 0 is symmetric. Then Theorem XV remains vahd if D{0) is replaced 

D'(<Z>) = supD'(0:^i,...,t?„); 



and the proof is substantially similar. 



V.10.4. Note that we have not shown the existence in the second 
part of Theorem XV of , ...,#„ such that 

lim inf - Mj , . . . , - «„) = <5 (F) : 

and indeed in general such . . . , do not exist^. When n = \, however, 
a does exist, as is easy to show. Here, of course, the only possibility 
for the distance function 0{x^ of one variable is 



0{X-y) = 



k Xj 
— f Xi 



if 

if 



0 

Xi^O, 



where k and I are positive constants. As in the proof of the second 
part of Theorem XV, we consider a lattice A with 



Let 



i(A)=t, F2(A)=<5(F). 

a = («o-«i). b = (bo,bi) 



be a basis for A, where without loss of generality 



Put 



f»i> 0 = 



( 1 ) 

( 2 ) 



After Theorem XV it is enough to show that 
lim inf 

►OO 

As in the proof of Theorem XV, it is enough to consider value of Mq 
and «] , such that 

'«o|«o^ + Mii^c-i*(5(F), (3) 

where c is a constant such that 0[x-^'^c\x^\ for all Xj. 

1 For example when n = 2 and ^ (Xj , Xj) = x\ + x\, as one may show by 
“isolation” techniques. Cf. Chapter X. 
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We consider now the point 

Y = Uoa +«i6 = (Moflo + Mj6o, Uoai + u^b^) = (Yo, Y^) 
of A. By (1) and (2), we have 

0(Yi) = bi 0{u^'& + Ml) . (4) 

But now, by (3), we have 

lim i = lim (a„ -Yb^^\=a^— b^-& = (5) 

t<0— >CO Wq \ ^0/ 

by (1) and (2). But 

Yo0(Yi)Sd(F); 

and so 

lim inf 0{u^& + Mj) ^ (5 (F) 

by (4) and (5). 

In particular, Theorem IV of Chapter II shows that 
lim inf MoImqJ? + Mi| ^ 5“^ 

for all &: and there exist numbers •& for which the sign of equality is 
required. Indeed the “successive minima” of Theorem IV of Chapter II 
correspond to a sequence of successive minima here. The original proofs 
of this used continued fractions, but there is a proof due to C. A. Rogers 
which uses the isolation techniques which will be discussed in Chapter X 
and which is given in the author’s Tract (Cassels 1957a). 

V.10.5. The proof of Theorem XV gives a simple case when in- 
equality necessarily occurs in Theorem II, that is, when we have a 
convergent sequence of lattices, 

and a distance function F such that 

F(M') > lim supF(M,.) . 

f— >CO 

Let F be the distance-function and M, the lattices occurring in the first 
half of the proof. Then 

F(M,) = 0 

for all r, since M, has points with x^^O. On the other hand, we con- 
structed a convergent subsequence of the M, such that 

where 

F”+»(N)^P(0:i?i,...,^„). 

The right-hand side here may well be strictly positive, as § 10.4 shows. 




Introduction 



175 



Chapter VI 

The theorem of Minkowski-Hlawka 

VI. 1. Introduction. Hitherto we have been primarily concerned to 
estimate the lattice constant A(y’) of a set from below, that is to 
find numbers such that every lattice A with tf(A)<d„ certainly has 
points other than o in In this chapter we are concerned with estimates 
for A{S^) from above; that is we wish to find numbers zlj such that 
there are certainly lattices A with d{A) =A^ which have no points other 
than the origin in i.e. are .V’-admissible. 

Hlawka (1944 a) showed that if is any bounded «-dimensional 
set with a volume (content) V in the sense of Jordan* and if A^>V, 
then there is a lattice A with d(A) =A^ which is admissible for y. He 
showed, further, that if .5^ is a bounded symmetric star-body, then it 
is enough that 

4>F/2C(n), (1) 

where 

C(n) =1 +2-” + 3--b---: (2) 

thereby confirming a conjecture of Minkowski. These results were put 
in a wider setting by Siegel (1945a). Denote by N^{A) =N(A) the 
number of points of A other than o in a set y; and by P^(A) =■ P{A) 
the number of primitive^ points of A in y. Siegel^ gave a very natural 
way to define averages over the .set of all lattices A with a fixed deter- 
minant d(A) =A^. If f(A) is any function of a lattice A, let us denote 
this average by 

SK{vi(A)}. (3) 

A 

Siegel showed that 

W{Ny{A)} = V(y)IA„ (4) 

A 

and 

W{PAr^)} = V{y)IC{n)A„ (5) 

A 

where y is any bounded set, not necessarily a star-body and not 
necessarily convex, which possesses a volume V{y) in Jordan’s sense. 

* This is rather more restrictive than the sense of Lebesgue, but if the volume 
is defined in the sense of Jordan it is also defined in that of Lebesgue and equal 
to it. Let ;((x) be the characteristic function of y, that is ;j(a:) = 1 if x£S and 
X (x) = 0 otherwise. Then y has a volume in the sense of J ordan if x {x) is integrable 
in the sense of Riemann, and the volume is equal to the integral of x{x) over all 
space. 

^ That is points agA which are not of the form a = Tib, where 5 gA and 
A > 1 is an integer. 

^ For a particularly simple exposition of Siegel’s averaging process, see 
Macbeath and Rogers (1958a). 
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Hlawka’s theorems follow at once from (4) and (5). If dj >F(y’), then, 

from the definition of the average, there must certainly by (4) be at 

least one lattice, say M, such that iVy>(M) ^ 3K (A^(A)) < 1. Since 

A 

is an integer, we must have =0, so M is .F’-admissible. 

Similarly, if 5^ is a symmetric star-body and /li>V(S^)/2C(n), then 
there must be some lattice N for which J^(N)<2. Since is sym- 
metric, points of N, other than the origin, occur in pairs, d;«. so 
i^(N)=0. Hence 5^ contains no primitive points of N and, being a 
star-body, can contain no points of N at all other than o. 

The constant C(«) occurs in (5), roughly speaking, because the 
probabihty that a point of a lattice A chosen at random should be 
primitive is («)}“^. More precisely, the ratio of the number of primitive 
points of A to the total number of points of A in a large sphere | a; | < /? 
tends to («)}"^ as R^<x>. 

When y is convex, improvements of the Minkowski-Hlawka theo- 
rem were obtained fairly soon after the original proof [see e.g. Mahler 
(1947b), Davenport and Rogers (1947a) and Lekkerkerker (1957a)]. 
However, even so, the smallest value of 

( 6 ) 

is not known even for 2-dimensional symmetric convex sets: though 
the same conjecture was made independently by Reinhardt (1934a) 
and Mahler (1947c) that it is attained when is a certain “smoothed 
octagon”, that is an octagon in which the corners are replaced by certain 
hyperbolic arcs. 

Mrs. Ollerenshaw (1953 a) has given an example of a 2-dimensional 
non-convex symmetric star-body for which Q{S^) is smaller than for 
the Reinhardt-Mahler convex octagon and constructed from it a set 
which is not a star-body for which 

(?= 1.3173.... 

It is not known whether this is the smallest possible value for a 2-dimen- 
sional set. 

For a long time no improvement was obtained on the Minkowski- 
Hlawka theorem for general sets or for star-bodies. However, almost 
simultaneously, improvements were made by Rogers (1955 a, 1955 b and 
1956a) and Schmidt (1956a and 1956b). Rogers’s work depends on 
elaborate estimates of the average 

m[{Ny(A)f] 

A 



( 7 ) 
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for positive integers k, where we have used the same notation as in (4). 
In a later paper Rogers (1958 a), using ideas of Schmidt combined 
with his own, shows that there is an absolute constant C such that 

^ ^ ^ I « log I - 2 log « - C (8) 

for all symmetric sets^, provided that the dimension n is greater than 
some absolute constant n^. We shall not discuss Rogers’s work further 
but refer the reader to the original memoires. Schmidt, on the other 
hand, uses an elegant device which is more effective than Rogers’s 
method for small dimensions but much less effective when the dimension 
is large. We shall discuss it more in detail in § 4. 

The work just described can be generalized in several directions. 
In the first place, instead of operating with the number Ny(A) defined 
above, one may consider more generally 

2 /(a), (9) 

a€ A 
+ o 

where f{x) is some function defined at all points of space and which 
may be subjected to certain conditions (e.g. that it be non-negative or 
Riemann-integrable). If f(x) is the characteristic function of then 
the sum (9) is just iV^(A). Again, one may confine the sum in (9) to 
primitive points of A, when there is an analogue of Py{A). In fact 
most of the work so far described has dealt with generahsations of this 
kind. Again, it was shown by Macbeath and Rogers (1955 a) that the 
Minkowski-Hlawka theorem extends to more general sets of points than 
lattices. It is enough for A to be any set of points such that the ratio 
of the number of points of the set A in the sphere | ac| < i? to the volume 
of the sphere should tend to a finite non-zero limit d &s oo. Indeed 
(4) continues to hold with a modified definition of the mean SR and with 
A^-=d-\ 

Finally, we observe that Mahler’s Theorem V Corollary of Chap- 
ter III often permits the results of this chapter to be extended to un- 
bounded sets on taking to be the set of points of in the sphere 
|a!|<r. 

VI.1.2. In this book we shall not consider any of these generaliza- 
tions in detail. In § 3 we shall prove the Minkowski-Hlawka Theorem 
in its original formulation, that is, the existence of a lattice A admissible 
for a sjmmetric star-body £/’ with finite volume V[6^) and with deter- 
minant arbitrarily near to V{6^). We shall use an averaging argument, 
but the type of average will be chosen to facilitate the proof, not for 

^ Professor Rogers tells me that Dr. Schmidt has obtained an improvement 
of (8) which is in course of publication in Acta Mathematica. 

Cassels, Geometry of Numbers 
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any deeper reason’. Then in § 4 we shall give an improvement of the 
Minkowski-Hlawka theorem using Schmidt’s ideas but not carrying 
the detail quite so far as he does. 

The arguments of §§3.4 depend on a thorough investigation of the 
properties of sublattices of prime index in a lattice and this is carried 
out in § 2. These investigations further enable one to prove the result 
conjectured by Rogers that if y” is a symmetric star-body and 
md{f\)<A(^) for some integer m and some lattice A, then contains 
at least m pairs of points ±oeA other than o. This we do in § 5- 

In § 6 we give an entirely different generalization of the Minkowski- 
Hlawka Theorem which applies only in 2 dimensions. We show namely 
that certain sets ^ of infinite volume (= area) are of finite type, that 
is, possess admissible lattices. The proof depends on a generalization 
of a theorem of Marshall Hall (1947a) due to the author (Cassels 
1956a). 

We do not use the contents of this chapter later in the book. 



VI. 2. Sublattices of prime index. An important tool in the work 
of both Rogers and Schmidt is the existence of sublattices of a given 
lattice with certain special properties. We shall use the definition and 
properties of an index introduced in Chapter I. 



Lemma 1 . Let p be a prime number and A an n-dimensional lattice. 
Let Oj, ..., be any points of A which are not of the shape pa, ae A 

and let ki, kf, be real numbers. Then there is a lattice M of index p 

in A such that 






pn-l_ 1 

p”- 1 



2 



( 1 ) 



Let bj , . . . , b„ be a basis for A. Let Cj , . . . , be integers and 



0<Cj<p (i^i ^n), ( 2 ) 

(Ci , . . . , c„) (0, . . . , 0) . (3) 



Let M(ci,...,c„) be the lattice of points MibiH hw„b„, where 

are integers, such that 

MjCi-f ••• -f M„C„= 0 (p). 



Clearly M(ci, ..., c„) is of index p. There are p” — \ such lattices and 
we now show that a point a, belongs to precisely p"~^ — 1 of them. 



’ Other averaging processes have been used. For a particularly brief proof of 
Theorem II using one of them, see Cassels (1953a). It has been shown by Rogers 
(1955a) that many of the averaging processes that can be used to prove the 
Minkowski-Hlawka Theorem are essentially equivalent to Siegel's. 
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We have 



where are integers not all divisible by p, by hypothesis. 

Without loss of generality, v,-^ is not divisible by p. The congruence 



+ ■■■ +'^,nCn=Q {P) 



(4) 



then determines q uniquely if are given subject to (2). In 

particular, (4) gives Cj = 0 if already Cj=---= .=0; contrary to (3). 
But Cj, . . . , c„ may be given any other of the p ' ^ — \ possible sets of 
values s;;bject to (2). Hence the average of th'; left-hand side of (1) 

over all lattices M = M (q c„) is given by the right-hand side, and 

so (1) must be true for at least one of them. 

We have at once the 



Corollary 1 . Let p be a prime number and let a^, ...,ap be p points 
of A none of which is of the shaps ph, b€A. Then th te is a lattice M 
of index p in A which contains none of a^, ... ,ttp. 



For we may put k^ = \ for \ ^r^p. For the lattice M of the theorem 
we have 






pn~\- 1 
p"- 1 



p <\. 



The number p of points in the corollary cannot be replaced hy p 
It is easy to see that if o^, Oj are any two points of A, then at least 
one of the p points 



Oj, Cj-l-rai — 1) 



is in each sublattice of index p. 

More generally we have the following corollary, due to Schmidt in 
essence. 

Corollary 2. Suppose that the number R of points a, satisfies 

p-\ 

for some integer m. Then there is a lattice M of index p in A such that 



o,eM 



pm-\ _ 1 



pm_ I 






(5) 



(i.e. n in ( 1 ) may be replaced by m). 

If the dimension n of the space is g w the result follows at once since 



1 

P ”'- 1 



> 



pn-\ _ 

p"- 1 



if m> n. 



12* 
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When n>mwe use induction on the dimension n. We say that two 
vectors a and a' of A, neither of the shape pb, bc\ are proportional 
mod p if there is an integer u and a vector c of A such that 

a = ua' + pc. (6) 

Clearly u is prime to p. The relationship is a symmetric one between 
a and a', since there is an integer v such that uv = \[p)\ and then 

va — a' -\- pc' 

or some c' c A. Further, if a proportional both to o' and a", then o' 
s proportional to a". We thus have a subdivision into classes or "rays”. 
The number of rays is clearly 

p”- 1 
^-1 ■ 

Since we are now supposing that n>m, at least one of these rays 
must contain no members of the set o, (1 If c is in this ray, 

it is of the shape c = wb where 6 is primitive and w is an integer prinje 
to p. Hence the primitive point 5 is in the ray, and we may suppose 
that 5 = 5i , where , . . . , is a basis for A. Then every point is 
of the shape 

®r = ^fl^l H + ^’rn^n> 

where by the construction of bj , at least one of i ;,2 - • • • • *'f n is not divisible 
by p. Hence if we make a, correspond to the vector 

in the (n — 1) -dimensional lattice Aq of points with integer coordinates, 
then o, is not of the shape pb, b g Aq . Since we are assuming that the 
corollary has already been proved for smaller values of n, there exist 
integers Cj, . . . , c„ such that 

2 2 

c,»„ 0 if) 

The lattice M of points 

«lbi -1 + “n^n 

with 

C2M2 + --- +C„«,= 0 {p) 

then does what is required. 

VI. 2. 2. A refinement of the argument gives a rather more special 
result than Lemma i in which now the must be non-negative. 

Lemma 2. Let p he a prime-number and A an n-dimensional lattice. 
Let Op, ..., Oj; he any points of A which are not of the shape pb, 
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6eA and let be non-negative real numbers. Then there is a 

lattice M of index p in A such that 

Oo€M 

and 

(1) 

0,6 M l^rSR 

We may choose a basis , . . . , for A such that 

®0 ~ > 

where is some integer, which is not divisible by p by hypothesis. 
For integers Cy with 

0<Cj<p (2^j^n), (2) 

denote by N (c^ , . . . , Cp) the lattice of points 

«i6i + -- - +m„6„, 

where the integers satisfy 

Mi + C 2«2 + -.-- +c„«„ = 0 (/>). (3) 

Clearly Oo€ N (c 2 , . . . , Cy,) . 

For \ ^r^R, let 

= V'VrnK- 

By hypothesis, not all of the integers v,i, are divisible by p. 

If all of v, 2 , ■■■, i>rn are divisible by p, then is not divisible by p\ 
and so a, does not belong to any N (c 2 , . . . , c„). If, say, v,^ is not divisible 
by p, the condition 

^i + C2i',2H = 0 ip) 

is satisfied for precisely one value of Cg if Cj, ..., are fixed; that is 
a, belongs to precisely p"~^ of the p"~^ lattices N(cj, Hence 

if M runs through aU the P"~^ lattices N (^ 2 , • • • , c») the average value 
of the left-hand side of (1) is 

p-^E'K, 

where E' denotes that the r for which v, 2 . i'r» are all divisible by p 
must be omitted. Since for all r, by hypothesis, this shows that 
at least one of the lattices M = N(c 2 , ..., c„) satisfies (1). 

VI. 3. The Minkowski-HIawka Theorem. Following Rogers (1942b 
and 1951b) we now prove the following theorem of Hlawka. 

Theorem I. Let f{x) be a Riemann-integrable function of the variables 
x = {xi, x„) which vanishes outside a bounded set. Let A^>0 and s>0 
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be given. Then there is a lattice M of determinant such that 



4 2 f[a)<jt{x)dx+s, 

aeM 
+ 0 

where 

dx =dx^...dx„. 



( 1 ) 



We may suppose that f{x) vanishes outside the cube 

max I Xy I ^ S (1 ^ ^ m) . (2) 

Let be a prime number and let rj>0 be determined by the equation 

PrT = A,. (3) 



We may choose p so large that 

pt]>S. (4) 

Let A be the lattice of points 







(5) 


where Mj , . , 


, . , are integers, so 




Now 


d{N)=rf. 


(6) 




t]” 2 f{a) < f f(x)dx + ^s 


(7) 



o€ A 

0^0 



if r] is small enough, by the definition of Riemann integration; and 
so (7) is true when p is large enough, by (3). 

A point a of A other than o for which / (a) 4= 0 hes in (2) ; and 
so cannot be of the shape ph, be A by (4). Hence we may apply 
Lemma 1 where eq , . . . , Oj; are all the points a of A other than o at 
which /(a) 4=0 and 

*, = /(«.)■ 

Then M has determinant 

=pd{A) =prj” = A^, (8) 

and 

o€M ^ a€A 

4=0 a4o 

Finally, (1) follows from (3), (7) and (9), when p is chosen large enough. 
As in § 1 we have the 

Corollary. Let be a set with Jordan-volume V(6^) and let 
Ai>V{S^). Then there is a lattice M with i(M) =A^ which is admissible 
for 
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For let f(x) be the characteristic function of and choose b so 
that Zli> F(.5^) +£. The number of points of M other than o in ^ is 
then 

06 M 
a^o 

by (t). Since the number is an integer, it must be 0 . 

VI. 3.2. The result corresponding to Theorem I in which only primi- 
tive points are summed over is ; 

Theorem II. Let f{x), and e be as in the enunciation of Theorem I. 
Then there exists a lattice M of determinant <f(M) =di such that 

tf{n) zli 2*/(«)<//(®) dx + e, 

06 M 

where the star {*) indicates that only primitive points are to be summed 
over. 

We only indicate briefly the modification required to the proof of 
Theorem I. In any case Theorem II is embraced in the generalization 
of Theorem I to point sets A other than lattices due to Macbeath and 
Rogers (1955 a), which was discussed in § 1 . The exposition still follows 
Rogers (1947 b and 1951 b). 

In the first place, it is trivial that a point of M in the cube ( 2 ) of 
§ 3.1 is a primitive point of M if and only if it is primitive as a point 
of A. Hence it is enough to show that 

lim»y" 2 */(a) ={C («)}■!// (05) (fa;. ( 1 ) 

’I—® o6A 

Now 

Z /(«) =Z Z* /(»'«)• 

a^A f=lo6A 

04=0 

Hence by Mobius’ inversion formula [e.g. Hardy and Wright ( 1938 a) 
Chapter XVI], we have 

Z*/(«) =Za(»') Z /(>'«)• 

A r o 6A 

a^o 

Hence 

»?"2*/(a) = 2^<^(''»'/). 

n^A r^l 

where, for any |> 0 , we have put 

o{^) =f” Z /(.'«)• 

u integral 

1140 
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But now a(i) is bounded for edl i, and 

limud) = / f{x)dx. 

The result now follows on letting p-^ oo, so rj->Q, since 

2 -^ “{«(»«-■ 

As in § 1 we have the 

Corollary (The “Minkowski-Hlawka Theorem”). Let be a bounded 
symmetric star-body with volume and let 2ll{n) A{>V[S^). Then 

there is a lattice M with <f(M)=/li which is admissible for 6^. 

VI. 4. Schmidt’s theorems. We are now in a position to illustrate 
Schmidt’s method of improving the corollaries to the last two theorems. 
We first give a simple example 

Lemma 3 . Let y be a symmetric star-body in n-dimensions with 
Jordan-volume V{y) and let be any number such that 

3 C(«) 4 > (i+2^-”]V(y). 

Then there is a y -admissible lattice M of determinant 
Let g{x) be the characteristic function of y, and let 
f{x) =g(a;) -f 2 g( 2 ar), 

so that 

'3 if xi^y 

f{x) =1 if x&y, xi^y 
. 0 otherwise 

and 

ff(x)dx = (i+2^-’‘) V(y). 

Choose e so small that 

3C(n)4>(l+2^-’‘){F(5^)-he}. 

By Theorem II with Aij2 for Ai and this e, there is a lattice A with 
determinant 

rf(A)=i4 

such that 

2 /(o)<6. 

<*€A, primitive 

Since f{—x)=f{x), by the symmetry of y, there is thus no primitive 
point of A for which f[a)=}, and so no point of A at all in except o. 
Further, there are at most two pdrs of primitive points say ± 0 ^, ± 0 ^ 
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of A in if. By Lemma i Corollary 1 , there is a lattice M of index 2 
which contains neither Oj nor Cj. Since Oj, are not in the points 
20i, 2O2 of M are not in y. Hence M is y-admissible. Since 

i(M) = 2d{f\) = zli, 

the lattice M does what is required. 

VI. 4.2. When n = 2, the result of Lemma 3 is no stronger than 
Theorem II Corollary. 

By further elaboration, Schmidt (1956a) improved Lemma 3 some- 
what but for values of n at all large Lemma 3 is weaker than the follow- 
ing Theorem III which applies to all Jordan-measurable bounded sets 
not merely symmetric star-bodies. To obtain results about symmetric 
sets, Theorem III should not be apphed to y directly but, say, to the 
“half-set” of points 

a; f y , Xj ^ 0 . 

Then 

and a lattice M is .^-admissible if and only if it is y-admissible. There 
is thus an additional factor 2 for symmetric sets. 

Theorem III. Let y be any hounded n-dimensional Jordan-measurable 
set of volume V{^\ and let be any number such that 

(1 +2i-")(l -P3i-'')F(y)<2Zli. (1) 

Then there is a lattice M of determinant A^ having no points, except pos- 
sibly o, in y. 

Let g{x) be the characteristic function of S and put 

/(a?) =g{x) -f2g(2x) A-figi'ix) +6g{6x). (2) 

Then 

I f(x) dx = (i + 2-2-’' + } ■ y” -\- 6 ■ 6-”) J g{x)dx 
= (1-f 21-") (1 + 31-") F(y). 

By Theorem I there is thus a lattice A of determinant 

i(A)= 4/6, (3) 

such that 

2 /(a) <12. (4) 

a e A 

a#: o 

We shall construct a lattice M of index 6 in A with the required properties. 

We classify the points a of A in y, other than o, into four types 
Ti , Tj . and Tj : 
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(i) a is in if it is not of either the shape a = 26 or a = 36 with 
6 €A. 

(ii) a is in if it is of the shape a = 2 b but not of the shape a=}b, 
with 6 6 A. 

(iii) a is in if it is of the shape a—}b but not of the shape a = 26, 
6 eA. 

(iv) a is in Ig if it is of the shape a = 66 , 6 tA. 

Let , fVj, Nj, Ng be the numbers of lattice points in the correspond- 
ing classes. Then by ( 2 ) and (4) we have 

Ai + 3A;+4A3+'l2iVg<12, (5) 

since, for example, the contribution to ( 4 ) of a€lg is 

1+2 + 3 + 6 = 12 . 

In particular, by (5), 

Afg = 0. 

Suppose, first, that N^> 0 . We apply Lemma 2 with p= 2 , taking 
Og to be one of the points in and a, , . . . , to be the remaining 
points in I 3 (if any) together with any points in Ij. The numbers k, 
of the lemma are taken as 1 if o+Si and 4 if ra+Tg- Then, by Lemma 2 , 
there is a lattice f of index 2 which contains N{, N 3 points of Ij, I 3 
respectively, where 

A^i' + 4N3'^i(iVi + 4^3-4) ( 6 ) 

(the —4 being the contribution of Og, which is definitely lost). All the 
points of %2 of course, in f. By ( 5 ) and ( 6 ) we have 

2iVi' +3A^2 + 8fV3' +4^11. 

Hence iVg' =0 and A/i' + A/ 2 =|. so Afi' + Afj^ 3 . But now by Lemma 1 , 
Corollary 1 there is a sublattice M of P of index 3 which contains 
none of these A^i'+A ^2 points. Then M does what is required. 

We may thus suppose now that 

iV 3 = 0 . 

We now apply Lemma 1, Corollary 2 with ^ = 3 to the points a, with 

k, = \ if a+Ij and ^, = 3 if a+Xj- Since there are at most 

1K(3^-1)/(3-'l) 

points a,, we may take m=2, so, in the notation of the corollary, 

pm-l_ 1 ^ 1 

p”<- 1 ~ T' 
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Hence there is a sublattice f of index 3 which contains points 

of Xj , X2 respectively, where 

^ I (N, + 

Hence = 0 and ^ 2. By Lemma 1 , Corollary 1 , there is a sub- 

lattice M of r of index 2 which contains none of these points. This 
lattice M does what is required. 

Thus in every case we have constructed a lattice M of index 6 in A 
which is admissible for Since 

(f(M) =6d{A) = /Ji. 

the lattice M has all the required properties. 

As Schmidt remarks, Theorem IH can be improved somewhat at 
the expense of further elaboration; but for large n is weaker than 
Rogers’ results which we referred to in § 1 and which we cannot prove 
here. In particular the factor (1 -f 2^“’’) (1+3^“”) on the left of (1) may 
be replaced by something smaller if ,5^ is a star-body, since then a point 
in is automatically in if f-^r. 

VI. 5. A conjecture of Rogers. We digress now from the general 
theme of the chapter to prove a result which was conjectured by Rogers 
(1951a), who compares it with the generalization of Theorem II of 
Chapter III from w = 1 to w>1. It was proved by Rogers when the 
number m occurring in it is a prime and by Schmidt (1955a) for all 
except a finite number^ of m. It has been proved generally in a rather 
wider context by the author (Cassels 1958a). We do not use it later. 

Theorem IV. Let hy a symmetric star-body and let A be a lattice 
with 

md{A)<A{:^), (1) 

where m^i is an integer. Then SL’ contains at least m pairs rh® 0/ points 
of A other than o. 

Theorem IV is an immediate consequence of the following theorem 
in which the reference to star-bodies disappears. 

Theorem V. Let a^, be primitive points of a lattice A and let 

ir ( 2 ) 

be positive integers. Then there is a lattice of index at most 

hH h/j? + 1=/ + l (say) (3) 

^ For all 10’ and all sufficiently large m, according to the review in Mathe- 
matical Reviews! 
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which contains none of the points 

±i,a, \^r^R). (4) 

We show first that Theorem V imphes Theorem IV. Suppose that 
A in Theorem IV contains fewer than m pairs of points of 5^. Since 6^ 
is a star-body, the points of A in can be put in the shape (4), where 
the number of pairs is 

J <m. 

Hence by Theorem V there is a lattice M of index ^ »n in A which con- 
tains none of these points, i.e. M is .^-admissible. Since 

d{M)^md{h)<A{Sr), 

by (1), this is a contradiction to the definition of A{£^). 

The proof of Theorem V depends on the following lemma, which 
gives the existence of primes with certain properties. It is due to 
Sylvester (1892 a) and was rediscovered by Schur (1929 a) who 
gave a rather simpler proof. The proof is in any case rather involved, 
so we do not give it here but refer the reader to the original papers. 

Lemma 4 (Sylvester). Let X, Y be integers and 

Y. 

Then there is a prime number p>X which divides one of the numbers 

Y+\,...,Y + X. 

We now prove Theorem V. Suppose first that i? = 1 . Since is 
primitive, it may be taken as part of a basis for A : 

®i — hi » hj , . . . , , 

where n is the dimension. Clearly the lattice M of points 

«i 6i H T- , 

where are integers and 



Wi = 0 (7 + t). 



does all that is required. 

We now consider the case when I? >1 and use induction on /. Without 
loss of generality 
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Let p be the prime given by Sylvester’s Lemma 4 with 



Then 



X = min{j\,ii + ■■■ +/;e), 
Y =max(/i,/2 + ••• -f 



p>X^i, (2^r^R). 



( 6 ) 



Since p divides one of the numbers Y+1, Y + X, we have 



that is 



X 

J 



+ 



y; 

J 



X+ Y 
~P 



+ 



■•• + iR 



< 



7i+ +1R 



(7) 



where for any real number ;t we denote by [x] in this proof the integer 
such that [x]^x<[x]-\-\. By Lemma 2, there is a lattice f of index p 
which does not contain Oj and such that 



that is 



ir\ 

«r€r 






/i+ + 1R 

P 



( 8 ) 



By (6), if a point in (4) with r>l is in f, then a, is in f. Since 
Oj is not in f, the only points (4) with r = i in f are the 



±i[{pa,) 



(9) 



But now, by the hypothesis of the induction argument, there is a lattice 
M of index at most 



1 + 



h 

P 



+ + 

o,er 



h 

P 



+ 



?8+ ••• +1R 

p 



in r which contains none of the points (4) at all. The index of M in A 
is p times the index of M in f; and so, by (7), is 






This concludes the proof of Theorem V. 

VI. 6. Unbounded star -bodies. The results of §§ 3, 4 extend to un- 
bounded star-bodies. For example we have 

Theorem VI. Let ^ be a hounded or unbounded symmetric star-body. 
Then 



A{y)^{2i:(n)Y^v{s^). 
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When y is bounded this is just Theorem II, Corollary. When 9 " 
is unbounded it follows from Theorem II, Corollary together with Theo- 
rem V, Corollary of Chapter V. 

In the same way any of the other estimates of §§ 3, 4 may be extended 
to unbounded star-bodies , or indeed, to any open sets of finite volume 
of which the origin is an inner point. 

VI. 6.2. There certainly exist star-bodies if of finite type [i.e. with 
A{S^)<oo] and infinite volume. A 2-dimensional example is 

\xiX2\ <i\ , ( 1 ) 

for which d(. 5 ^j) = 5 ^ as we saw in Chapter II. More generally, in 
n-dimensions the body 

<"1 

is of finite type but infinite volume, since admissible lattices are given 
by the norm-forms of totally real algebraic fields of degree n (see 
Chapter X). In general, in more than 2 dimensions it is very difficult 
to decide whether a given star-body is of finite type or not. Two 3- 
dimensional examples are discussed in Cassels and Swinnerton-Dyer 
(1955 a), for which a decision on this point would have interesting reper- 
cussions. In 2 dimensions however there do exist general criteria which 
we shall now discuss. 

VI. 6. 3. From now on we put ^ 

n — 2 . 

In an obvious sense, the body . 9 ^ defined in (1) of § 6.2 has two pairs 
of asymptotic arms, the asymptotes being the % and %2 axis. It is 
possible to inscribe in if arbitrarily narrow parallelograms with one 
pair of sides parallel to an asymptote and area 1, for example 

In a sense is a limiting case, since if it is possible to inscribe in a 
star-body y parallelograms with centre the origin and arbitrarily large 
volume (area), then y is of infinite type by Minkowski’s convex body 
Theorem II of Chapter III. Roughly speaking, any star-body with a 
pair of arms wider than those of is of infinite type. We now show 
that a 2-dimensional star-body may have any finite number of arms 
like those of if^ and still remain of finite type. 

^ It is customary to call 2-dimensional star-bodies “star domains” but we do 
not follow this usage. Similarly we may sometimes continue to speak of volume 
where area is more usual. 
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Theorem VII. Let 
for some e> 0, and let 

fj(Xl.X2) 

be any finite number of indefinite quadratic forms. Suppose that the 



distance-function F{xy, Aj) satisfies 




F2(xi,X2)^^mm |4(xi,X2)1 


(2) 


for all (xj, X 2 ]. Then the star-body 




6T-. F{xi,x.f)<\ 


(3) 



is of finite type. 



The exponent 2 in (2) is dictated by reasons of homogeneity. 

We shall deduce Theorem VII from the following generalization of 
a theorem of Marshall Hall ( 1947 a) which is due to the author 
(Cassels 1956 a). 

Theorem VIII. Let , ..., be any real numbers. Then there exists 
a real number a such that 

\»\\(<x+^k)^ + ^\ > (4) 

for all integers « =)= 0 and v. 

We first deduce Theorem VII from Theorem VIII, and then prove 
Theorem VIII in § 6.4. After a suitable rotation of the co-ordinate 
system, we may suppose without loss of generality that 

4(1,0) + 0 (ig/^7); 

and so 

4 (% - Xi) = 4 (% + X 2 ) (Xi -f- 9 ?,. X 2 ) (5) 

for real numbers A, , , cp, such that 

Ay 4=0, i?y4=9’;- 

But now 

|4(Xi,Xj)| ^/.«ymin{|xj(Xi-f dyAJ^)], \x2[x^ + (pjX^)^, (6) 

where 

/ry = i|Ay||t?y-9?y| >0) 

since if, for example 
then we have 

K^y - cpf) X 2 I = |(;ri -b &jX^} - (Afi -b 9?y X2)\ ^ 2 l^i -b 9>y . 




192 



The theorem of Minkowski-Hlawka 



We apply Theorem VIII where the jffj, are the (pj in some 

order, so K = 2 J. Let a be the number given by Theorem VIII, so that 

|«{(a + i?,)M + v}|^?7>0 I 

!“{(« + 9’;) “ + J 

or integers u=^ 0 ,v, where 

i?={8(2/ + l)^}-b 

Let A be the lattice of points 

ix^.Xi) =R{a.u + v,u), ( 8 ) 

where u, v run through all integer values and i? is a positive number 
yet to be chosen. If « 4=0 we have, by (6) and ( 7 ) 

\ij{R[a.u-\-v).Ru]\^(x^R^ri. ( 9 ) 

If however « =0 but t;=t=0 then, by ( 5 ), 

|4(i?t;,0)|^|Ay|i?i*. (to) 

Similarly 

to[xi,x^) = s{x\ + x\)^eR^ ( 11 ) 

for all («i, A;j)tA other than o, on distinguishing the two cases «=t=0 
and M = 0, v=t=0 in (8). We may choose R so large that the right-hand 
sides of (9), ( 10 ) and (11) are all not less than 1. Then for all (x^, a;2) sA 
except o, we have, by (2), 

Xi) I fj (% , X 2 ) I ^ 1 ; 

that is A is ^-admissible. This concludes the proof of Theorem VII. 

VI. 6.4. We now prove Theorem VIII which was enunciated in 
§ 6.3. Write 

x^{2{K + \))K ( 1 ) 

We shall construct a sequence of open intervals -^_i, *^i< which 
enjoy the following three properties: 

(i) m •4.+1 is contained in 

(ii) „ J„ is of length 

(iii) „ the inequality 

m|(* + /?*)“ + «^| > (2) 

holds for all numbers a in and for all integers v and u with 



OKu-^x”. 



( 3 ) 
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If we can construct the we shall have proved Theorem VIII, 

since there is a number a contained in all the intervals and then 

(2) holds with this a for all integers «>0 and v. 

We may take to be the interval 0<a<l, since there are no 

integers u in (3) with m = — \. We thus assume that has already 

been constructed and construct By (ii)„, the open interval 

is the set of a satisfying , „ 

a<a<a (4) 

for some numbers a' and a" for which 



a." -oi' = (5) 

For each k {\^k^K), there is at most one fraction vjui^ in its lowest 
terms such that 

( 6 ) 

since two fractions vju with differ by at least 

By (iii)„, we have 

u,> x”' {i^k^K). (7) 



Let 'S be the set of a such that 

a'-f (8) 

and 

«*!(«+/?*) + (9) 

for all k in for which a vju^ of the type (6) exists. Then ^ 

consists of at most K -\-\ intervals. Their total length is 

k 

t’y (I)> (5) and (7). We may therefore find in ^ an open interval 
of length exactly Then satisfies (i)„ and (ii)„+i, by 

construction. It remains only to verify (iii)„.^i. We may clearly suppose 
that u and v are coprime and that 

x”<u-^xr+^ ( 10 ) 



by (i)„ and (iii)„. If v/m=v*/m* is a fraction of the type (6), then 

u\{a+P^)u + v\>\x-* ( 11 ) 



for all by (9)- Otherwise — + /3J is not in and so 



V 

u 



+ (“ + 



> 



l^-2«-4 

2 



for all by (8); then (11) follows, by (10). Thus has all 

the required properties. 
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Chapter VII 

The quotient space 

VII. 1. Introduction. Before resuming the general study of the geo- 
metry of numbers, it is convenient to introduce here the concept of the 
quotient space of an ^-dimensional space by a lattice. This concept 
plays an important role in the discussion of inhomogeneous problems in 
Chapter XI ; but we shall also need it in Chapter VIII as it gives the 
most natural interpretation of Minkowski’s theorem about the succes- 
sive minima of a convex body with respect to a lattice. 

In § 2 we give the definition and most important properties of a 
quotient space. In § 3 we prove a result which will be basic for one 
topic in Chapter XI. 

VII. 2. General properties. Let A be a lattice in ^-dimensional eu- 
clidean space. Two points of the space are said to be congruent 

modulo A, written 

(A), (t) 

if the difference y^—y^ is in A. This relationship is clearly symmetrical 
in i/i and y^. If 

(A). Vi = yz (A), 

then 

= (A). 

The points y may therefore be divided into classes ^ so that two points 
y and y' are congruent if and only if they are in the same class. A class t) 
consists of all the points yo+a, where y^ is some fixed member of p 
and a runs through all points of A. 

If 

y’^y (A), (A), 

then clearly 

y'J^z' = y + Z (A) . 

Hence there is no ambiguity in defining the sum ^ of two classes 
as the class to which y-\-z belongs when y, z are any members of t), j 
respectively. 

Similarly, if t is an integer, the definition of as the class to which 
ty belongs when ^ is in p is unambiguous. On the other hand, if t is 
not an integer, it is not, in general, true that ty'=ty when y'=y. 
Hence tl) for real numbers t other than integers must be left undefined. 

So far, of course, we have only followed the standard procedure for 
finding the quotient group of an abelian group (namely the additive 
group of all vectors) by a subgroup (namely the additive group of vectors 
in A). We shall say that the classes Q are points of the quotient space 
where ^ will denote the original «-dimensional euclidean space. 
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VII. 2.2. Let F{x) be any distance function defined in ^ and put^ 

L'(t)) =inff(t/) ( 1 ) 

»ei) 

for 1 ) 6 ^/A. This is the function which will be important in inhomo- 
geneous problems (Chapter XI). Note that 

F(o)=0, (2) 

where o is the class to which o belongs. For reference we enunciate the 
principal properties of F(j), je^/A, in the following lemma. 

Lemma 1. Let F(x) be a distance function and let F{i) be defined, as 
above, for je^/A. Then 

(i) F[ti)^tF{i) for integers t^O. 

(ii) If F(x) is convex, then so is F{g), in the sense that 

F(i + t))^F(i)+F(t)) 

for all j, 

( Hi ) If F[x) =0 only for x—0, then F (g) — 0 only for g = o. Further, 
for each ^g^/A there is a such that F[t)) =F{y). 

(iv) If Fj(x), F 2 (aj) are two distance function and F^[x)'^c F^(x) for 
some number c and all xe^, then F^[i)-^cF.^{g) for all jg^/A. 

Here (iv) is an immediate consequence of the definition ( 1 ). By the 
definition of a distance function, we haveF(/a?) =tF[x) for all real t>0. 
Hence, if />0 is -an integer, we have 

F[ti) = inf F(y) g infF(tx) =/infF(a;) =tF{i). 
yiti aej 

This establishes (i). The proof of (ii) is similar and may be left to the 
reader. 

It remains to prove (iii). Let t)g^/A and let so that the general 
element of t} is yo+“> <*^A. By Lemma 2 of Chapter IV, there is a 
constant c>0 such that F(a;)^c|a;| for aU x\ and so 

F(t/o + «)^cli/o + a| ^c|lal 

In particular, if F(a +j/j)gF(j/j), we have 

|al^|i/o| +c-iF(j/o). (3) 

There are only a finite number of agA in ( 3 ). Hence there exists an 
OogAsuch thatF(i/o+ao) = infF(j/o + a).Bydefinition,F(t))=F(yo+Oo). 

oeA 

Further, F(t)) =0 only if F(i/Q-l-ao) = 0 , that is t)= 0 . 

1 There should be no confusion with the usage of Chapter IV, since there the 
arguments were lattices; and here they are classes with respect to a lattice. 

13* 
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VII. 2.3. Let t), (1^r<oo) be a sequence of elements of ^/A. We 
say that the sequence tends to if 

,lim =0, (1) 

f — ► 00 

where, in conformity with the notation of § 2.2, we have written 

lj|=inf|®l. (2) 

®€E 

Lemma 2. A necessary and sufficient condition that is that 

there exist elements y'^^' 

Vr-^y'- (3) 

Suppose, first, that the y,, y' exist such that (3) holds. Then 

\%-^'\^\yr-y'V. 

so (1) holds, that is 

Suppose, now, that (1) holds. By Lemma 1 (iv) there exist t)' 

such that 

l«r| =hr-^'|- 

Let y' be any element of and put y,=y -{-<^r- Then the y, clearly 
have all the properties required. 

VII.2.4. Let 

bi, •••,&„ (1) 

be any basis for A. Then every point x of space can be put uniquely 
in the shape 

®=fibi+---+l„b„ (2) 

for some real numbers li, ..., and a;eA if and only if ..., are 
integers. Hence to every vector x there is a unique aeA such that 



y =aj-a=j?i6iH (3) 

where 

0^rjj<l. (4) 

In other words, every je^/A has precisely one representative i/ej in 
the half-open parallelopiped ^ defined by (3) and f4). We say that 
this parallelopiped is a fundamental parallelopiped for A. Different 
bases 5,- in general give rise to different fundamental parallelepipeds. 

An immediate consequence of Lemma 2 and the existence of a 
fundamental parallelopiped is 
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Lemma 3- The quotient space ^/A is compact. That is, any sequence 
(1^r<oo) of elements of ^/A contains a convergent subsequence: 

(5) 

The fundamental parallelopiped 0^ is not compact, since although 
it is bounded it is not closed. Let 0 be its closure, that is the set of 
points (3) with (1^/^w). Let be the representative of 

in 0. By Weierstrass’s compactness theorem (§ I .3 of Chapter III), 
there is a convergent subsequence 

where y'^0. Then (5) holds by Lemma 2, where y'gt)'. 

VII. 2.5. We are now in a position to introduce a measure into the 
quotient space ^/A. Let S be any set of elements of 0jA. We call a 
set 0 of elements of ^ a set of representatives for S if (i) for each 
jeS there is precisely one which belongs to 0 and (ii) each x£0 
belongs to an jgS. We say that S is measurable if at least one set 0 
of representatives is measurable. 

Let be the set of elements x€0 oi the shape 

x=y + u, y€0, ueA, 

where 0 is any measurable set of representatives of S and ^ is a funda- 
mental parallelopiped. By Theorem I Corollary of Chapter III, the set 
01 is measurable, and 

V(0,} = V(0). 

In particular, if 0, 0' are any two measurable sets of representatives 
of S, we have V{0) =V{0'). This common value will be denoted by 

m(S) 

and will be called the measure of S. 

Clearly the measure of the whole of the quotient space is the volume 
of the fundamental parallelopiped 0, that is d{A). 

Let T be any homogeneous mapping of w-dimensional space 0 onto 
itself. In a natural way, it gives a mapping of ^/A into 0\xA, which 
we may also denote by t. If m' is the measure defined in 0(xA in the 
way that m is defined in 0jA, then clearly 

m'(TS) = ldet(T)| m(S) 

for any set S in ^/A. 
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VII. 3. The sum theorem^. If C and D are two sets of points in 
the quotient space 3t\K we denote by C + D the set of all points 

c + b , where c 6 C, b g D . 

This section is devoted to proving 

Theorem I. Let C and D he non-empty sets in dljA with measures 
m(C) and m{D) respectively. 

(i) If m{C) +m(D) >d{A), then C + D is the whole space SijA. 

(ii) If m(C) +m(D)^(f(A), then m {C D) ^ m {C) + m(D). 

This theorem is due to Macbeath (1953a). It was discovered inde- 
pendently by Kneser (1955a), who first recognized its importance for 
the geometry of numbers. Theorem I is, in fact, now only part of a 
much wider theory, for which see Kneser (1956a) and the Uterature 
cited there. It falls into the same circle of ideas as the so-called “a -f/S 
hypothesis” about the densities of sequences of integers which was first 
proved by Mann. As all this is rather aside from the main theme of 
the book we do not discuss it further. It is convenient to prove Theo- 
rem I here but the application to the geometry of numbers will not be 
made until Chapter XI. 

Part (i) of Theorem I is easy. Suppose that there is a point f of 
^/A which does not belong to C -f D. Then none of the points 

5 -c, ceC (1) 

can belong to D. We may denote the set (1) by j — C. Clearly 

m(E-C)=m(C). (2) 

But D and E — C have no points in common, so 

m(E— C) -f m(D)^m(^/A) =i(A). (3) 

Then m{C) +m{D)^d{A), by (2) and (3). This proves (i). 

In what follows we denote, as is conventional, by C D and C^D 
the sets of points which belong to both C and D and to either C or D 
(or both) respectively. We note for further reference the identity 

m(C^D) m(C^D) = m(C) -j- m(D); (4) 

which becomes clear on noting that points of C D occur in two sets 
on each side of (4), but points of CwD other than those of C'^D occur 

* The results of § 3 will not be needed until Chapter XI. 
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in precisely one set on each side. Further, we show that 

C + D>(C^D) + (CwD) (5) 

( ) means "contains”). For let 

agCr^D, beCwD. 

Suppose b belongs to C : then we may regard a as belonging to D since 
it belongs to both C and D. Hence a+b = b+ Q€C+D. Similarly, if b 
belongs to D we regard o as belonging to C. 

It follows from (4) and (5) that, if the conclusions of Theorem I are 
true when C^D, C^D are read for C, D respectively, then the con- 
clusions are also true for C and D themselves. This is one of the principal 
ingredients of the proof. The other is provided by 

Lemma 4. There is some such that 

d(A) m((C -f j)^D} = m(C) m(D). 

Before proving Lemma 4 we complete the proof of Theorem I with 
its use. Let C, D be two sets with 

m{C)==yd[A), m(D)=dd(A) 

and 

y + (5 ^ t . 

If y=0, the conclusions of the theorem certainly hold, since C is non- 
empty, by hypothesis, and if ceC the set c+D, which is contained in 
C + D, has measure m (D) = m ( C) + m (D) . We may thus suppose without 
loss of generality that 

0<y^S, y + <5^1. (6) 

Now let j be given by Lemma 4, and put 

Ci = (C + j)^D, Dj = {(C -f f) w D} — j. 

Write 

m(Q=y^d(A). m(D,) =did(A), 

so that 

yi + ^i = y + <5, 

and 

yi = yd 

by (4) apphed to C-|-j and D and by Lemma 4 respectively. Further, 

C + D ^ Cj + Dj , 
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by (5) applied to C + J and D. We may now repeat the process on 

Cj, Dj. In this way we get a sequence of sets C,, with measures 

y,d[N), d,d{l\) respectively, such that 

C + D^C, + D,., (7) 

and 

Yr + \=y + ^> (8) 

Yr=Yr-l^r-l- (9) 

But now, by the argument used when y = 0, it is certainly true that 

m{C, + D,)^m{D,)=d,d{A). (10) 

It follows from (6), (8) with r— 1 for r and (9), that 

y,^y,_i(1 -Yr-i)> 

and so 

7,-^0 (r->oo). (11) 

Hence 

(5,.->y + (5 (r^oo), (12) 

by (8). But 

m(C + D)^,5,<i(A), (13) 

by (7) and (10). In letting r^oo in (13) and using (12) we have 
m(C + D)^(y + (3)(f(A) =m(C) +m(D) 

as required. 

It remains only to prove Lemma 3. We note, first, that 

”’{(C + f)^D} (14) 

varies continuously with j. This is clearly true with the “well-behaved" 
sets C and D to which we will wish to apply Theorem I, but it is in fact 
true for all measurable C and D, see for example A. Weil (1951a). 
In the second place, in an appropriate sense, to be explained more fully 
below, the average of (14) as j runs through MfA is w(C) m(P)jd{A). 
Perhaps the simplest way is to observe that we may introduce integration 
in ^/A in the obvious way. Let (p (j) be a function defined in 3 ijA and 
let f[x) be the function in ^ such that 

t{x) =<p[i), 

when X belongs to the class j. Then we write 
J<p{p)dp=Jf{x)dx, 

«/A ^ 

where ^ is a fundamental parallelepiped. Exactly as in § 2.5, one may 
show that this definition is independent of the choice of fundamental 
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parallelopiped Let 95(5), xii) be the characteristic functions of C, D 
respectively; so that 



m {(C + e) D} = / 95 (p + E) (t)) if Q . 

«/A 

Then 

/m{(C + E)'^D}ifE = /{ / 95 ( 9 +?)%(!)) 

311 A 311 A '■311 A ‘ 

But 

/?’(b + E);tW'^E = %(9)m(C). 

31IA 



(15) 



Hence, on interchanging the order of integration in (15), we obtain 



J m{(C + Tc) r. D} = m{C) f x(^) dt) = m{C) m (D) . 

31IA 31IA 



Since ^/A has measure 

m{^IA)=fidi=d{A), 

«/A 

the truth of Lemma 4 now follows from the continuity of m{(C +e) r«D} 
and the connectedness of ^/A. 



Chapter VIII 

Successive minima 

VIII. 1. Introduction. For some purposes one requires to know not 
merely that a lattice A has a point in a set but that it has a number 
of linearly independent points in 

Let F{x) be an w-dimensional distance function and A a lattice. If 
for some integer k in i^k^n and some number A the star-body 

Xy: F{x)<X (1) 

contains k linearly independent points 

®i > • • • > (2) 

of A, then so does fiy for any fi>X, since the points (2) are also in 
/j. y. We define the A-th successive minimum A* = A* (F, A) of the dis- 
tance function F with respect to the lattice^ A to be the lower bound 
of the numbers A such that \y contains k hnearly independent lattice 
points. Clearly 

Ai^A,^--.^A„. ( 3 ) 



^ Or of the lattice with respect to the distance function. 
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The numbers defined above certainly exist, since if 

Cj , . . . , a„ are any n linearly independent points of A, then, trivially, 

Xk^X„< maxf(a,). 
isysn ’ 

In the notation of § 4 of Chapter IV we have 



X^ = F{f\) = inf F(a). 

aCA 

4=0 

Hence, by the definition of 

F”(A) 

we have 

X1^6{F)d{^). 

The remarkable inequality 



(4) 

(5) 

( 6 ) 
(7) 



was discovered independently by Rogers (1949a) and Chabauty 
(1949a); and Chabauty (1949a) and Mahler (1949a) independently 
produced examples to show that if x is any number then there 

are distance-functions F and lattices A such that 



xd{F) d{t\] . (8) 

We shall give the elegant proof of (7) in § 3 and give the construction 
of the counter-example to show that it cannot be improved in the case 
M = 2. The difficulties in extending the counter-example to n dimensions 
are purely algebraic. It can be shown easily by means of an example 
that 

(5(F) d (A) 

can be arbitrarily small, so there is no lower bound analogous to the 
upper bound (7) [but see (13) below for symmetric convex F], 

The inequahty (7) holds with a suitable definition of the terms not 
merely to star-bodies F{x)<i but to all point sets 6^ whatsoever. 
There have been several different definitions of the successive minima 
of an arbitrary set y. We do not discuss these further, but refer the 
reader to the papers quoted for the extensive hterature. 

It was shown already by Minkowski (1896a, § 51) that, when F{x) 
is the euclidean distance |a;|, the inequality (7) may be replaced by 

X^...X„^d{F)d(A). (9) 

We give his proof in § 2. More generally, it has been conjectured that 
(9) holds for all symmetric convex distance functions. In § 4 we shall 
show for these F that 

Xr^X„^d{F)d{A); ( 10 ) 




Introduction 



203 



which is equivalent to (9) when n=2. The inequality (10) was ap- 
parently discovered by Chalk and Rogers (1949 a) and Chabauty 
(1949a) independently. It has been shown by Woods (1956a and 
1958b) that (9) continues to hold for «=3 when F is symmetric and 
convex and for n=2 when F is convex but not symmetric : the proof 
is distinctly intricate and we do not discuss it here. For general n 
and symmetric convex F, Rankin (1953^) indicates that the constant 
2i(»-i) (-an be replaced by a rather smaller one. 

For S5Tnmetric convex functions F and any n, there is a result going 
back to Minkowski (1907a) which may be regarded as a substitute 
for the unproved conjecture that (9) holds. In our notation, Min- 
kowski’s convex body Theorem II of Chapter III states that 

X\Vp^2”d{^), (11) 

where Vp is the volume of F(aj)<l ; and so XiVp is the volume of the 
body Fix)<Xi, which, by hypothesis, contains no point of A except o. 
Minkowski’s theorem is that in fact 

X,...X,Vp^2’'d{A). (12) 

The proof of (12) remains difficult. Simpler proofs than the original 
have been given by Davenport ( 1939 c) and Weyl (1942a). We follow 
Weyl in § 4, since the ideas introduced will be needed in Chapter XL 
For symmetric convex F there is also an inequality 

X,...X„Vp^^d{A), (13) 

the almost trivial proof of which is also given in §4. From (12) and 
(13) it follows that the product ... A„ is determined by Vp and <f(A), 
except for a factor which is bounded in terms of n. 

In general, it is hopeless to expect more information about successive 

minima than can be deduced from the formulae for the product . 

For example, let Xi, .... X„ be any numbers such that 

^ A2 ^ ^ ; Xi-.. X„ = i . 

Then the lattice A of points 

(AiMj, 72«2 , ..., A„m„) («q,...,M„, integers) 

has (A) = 1 and has successive minima Aj , . . . , with respect to the 
distance function 

F(x) = max lx, I , 



as is easily verified. 




204 



Successive minima 



VIII. 1.2. For later purposes we shall often need the following two 
simple lemmas. 

Lemma 1 . Let he the successive minima of a lattice A with 

respect to a distance function F associated with a bounded star-body 
F{x) <1. Then there exist n Linearly independent points a^, 
such that 

F{a^)=X^ 

If agA and F{a)<Xj, then a is linearly dependent on a^, a^_i. 

For by the definition of X„ there are n linearly independent points 
of A in 

+ F (f) 

By Lemma 2 of Chapter IV, the set (1) is bounded and so contains only 
a finite number of lattice points. Only these points need be considered 
in the definition of the Xj. The truth of the lemma is now obvious. 

Lemma 2. Let X^, X„ be the successive minima of the distance 
function F with respect to the lattice A. Then there is a basis 

bi, ... ,b„ 

of A such that, for each j = \, 2, ... , n, the inequality 

F(x) < Xj 

implies that 

x=Uibi-\ |-M,-ib,-i 

for integers «i , . . . , «y_i . 

When F{x)=0 only for x = o, this is a trivial consequence of Lem- 
ma 1 , since we may choose b^, ...,b„ so that Oj for each j is dependent 
only on b^, ...,bj, by Theorem I of Chapter I. 

Otherwise a slightly more refined argument is needed. In general, 
the Xj will not be all unequal, but there are numbers 

lh<A2<---<As. 

for some s in 1 gs^«, such that 

where 

0 = k^<k^<-- - <k^ = n. 

By the definition of successive minima, there is no point of A with 
F(a)</j,i except, possibly o. Since 

1^2 > > 

* For a general distance function F(x) there is, of course, no reason why Aj 
should not be 0. Indeed, if F{x) = | jTj . . . we have Aj= ■ • • = A„= 0 for the 

lattice A, of points with integer coordinates. 
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the are linearly independent points 

«i.02. •••.«*. (2) 

of A in F(x)<fi^, and, since 

every other point of A in F{x)<fi 2 is linearly dependent on them. 
Similarly, we may find linearly independent points of A in F{x)<n^ 
such that every other point of A in F{x)<fi^ is linearly dependent on 
them. Since fi 2 </J '3 we may suppose that of these points are 
Oi, already determined. We may thus denote by 



®i 1 • • • > ®*, 



the maximal linearly independent set of points of A in F{x) without 
disturbing the notation (2). And so on. In this way we obtain k^-i<n 
points 



of A such that 



. <*2 > • • • > ®*,_i 






By Theorem I of Chapter I there is a basis bj, ..., 6„ of A such that, 
for each / = 1 , . . . , Aj_j , the vector is linearly dependent on bj , . . . , by 
only. This basis clearly has all the properties required. 

VIII. 2. Spheres. We first prove the results for spheres, since they 
are simplest and the treatment forms the model for what follows. 

Theorem I. Let 

-f^o(®)=i®| (i) 

and let , . . . , be the successive minima of a lattice A with respect to 
Fq. Then 

( 2 ) 



The left-hand side of (2) was substantially proved in Theorem XIII 
of Chapter V. We have on the one hand 



ldet(Oi,...,a„)| =Id{^)^^d{^), 

where I is the index of cq , . . . , a„ in A, and, on the other hand. 



ldet(Oi,...,a„)|^|ai|...|a„l 

by Hadamard’s Lemma 9 of Chapter V. If now the Oy are the hnearly 
independent vectors of A with F{af) = Ay given by Lemma 1 , the required 
inequality follows at once. 
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It remains to prove the second part of (2). As in the proof of Lemma 9 
of Chapter V, there is a set of mutually orthogonal* vectors Cj, c„ 
such that 



— ^/l^l H + ^ii^j 



for some real numbers [n^j], where bj is the basis given by Lemma 2. 
By incorporating a factor in c, we may suppose, without loss of generality, 



that 


|c,p = l 


Then 


Z^jbj = ZZ^,i,iCi'. 


and so 


1 * 7 * 










We now show that 





for all sets of integers u^o. For let Mj, ..., m„ be integers, and suppose 
that 

M;=bo, My = o (;>/)• (5) 

Then u^hy-\ *s **ot dependent on b^, .... bj_i', and so 

( 5 ') 



Further, (5) implies that all the summands in (3) and (4) with i> J 
are 0. Hence, and since if the left-hand side of (4) is 



•S/ 




^7^ 






by (3) and (S'). Hence if A' is the lattice with basis 



6' *CiH 

we have 



for every point ^Myby + o of A'; that is 



On the other hand. 
But now 



Fo(A')=|A'|^l. 



^(A') 



|A'|" 

d{A') 



^ IMI” 






( 6 ) 

( 7 ) 

( 8 ) 



* We say that two vectors a, b are orthogonal if their scalar product ab 

vanishes. 
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by the definition of The right-hand side of (2) follows now from 

(6), (7) and (8). This concludes the proof of Theorem I. 

VIII. 2.2. As was remarked in Chapter V, the theory of successive 
minima shows that the hypotheses of Theorem III and IV of Chapter V 
are equivalent. This we do now. 

Lemma 3- The following two statements A and B about a set 2 of 
n-dimensional lattices A are equivalent, where x, K, ZIq, are supposed 
to depend on 2 but not on A. 

(A) there exist Zl, < oo and x>0 such that d (A) ^ , and | A| ^ x > 0 

for all Ae2. 

(B) there exist Af,>0and K<oo such that d{A)^A,x>0 and the sphere 
\x\^K contains n linearly independent points of A, for all A 6 2. 

If Xi, are the successive minima of i^(ac) =|ac| with respect 

to A, then clearly (A) and (B) are equivalent to 

(A) d(A)^Ax, Ai^x>0, 

and 

(B) d{A)'^Ao>0, k„£K, 

respectively. We now use the inequality 

^(A)^Ai...^^,5(fo)^(A) (1) 

of Theorem I. Suppose first that (A) holds. Then 

d (A) ^ {,5 (F„)}-i A, . . . A„ ^ (F„)}-i X" = Zlo (say), 

and 

A„^(Ai...A„_j)-id(Fo)^(A)^;r-+M(F„)4 = F (say). 

These are the two conditions (B). 

Suppose now that (B) holds. Then 

(A„A„_i... A 2 )'^«f(A) ^ K-’'+^Aa = y. (say). 

and 

rf(A)^Ai...A„^F" = 4 (say). 

These are the two conditions (B). 

VIII. 3. General distance-functions. We first prove a lemma which 
will be required later. Just in this section we denote by {x} the fractional 
part of X, that is, the number such that 

0 ^ {%} < 1 , X — {x} = integer. 

Lemma 4. Let r]x,...,rj„ be any real numbers. Then there is a 
number rj such that 

2 {rjj~ri}^\(n - 1 ). 



( 1 ) 
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For any number | we have clearly 



Hence 






0 if {|} = o- 

1 otherwise 



^ 1 . 



ISAgn lgA<;g» 



Thus there is at least one k such that (1) holds with rj=rji^. 
We shall require only the more specialized 
Corollary. Let fi„ be any numbers such that 



( 2 ) 

Then there exists a real number /li> 0 and positive integers m-^, w„ 
such that 

(i) wiy+i/wy is an integer 
( ii) fi mj ^ fij (i^j^n), 

and 

( in) ifi mi)... (jj, m„) . 



We shall in fact take all the m^ to be powers of 2, say 



Let 



mj = (1 ^ ^ m ) . 

/ij = 2’>i (1 g / ^ n) 



(3) 

(4) 



for real numbers ?/,■; and let rj be the number given by Lemma 4. By 
subtracting an appropriate integer from rj we may suppose, by (2) and 
(4), that 



If now /j, = 2’’ and the integers Ij are defined by 

Vi-V=h+{Vj-V}’ 

then the numbers m^ defined by (3) clearly satisfy (i) and (ii). Further, 
by the lemma, 

JJ (J^'j = 

which is just (iii). 

VIII. 3.2. We are now in a position to prove 

Theorem II. Let F{x) be a distance-junction and Ai, . . . , its succes 
sive minima with respect to a lattice A. Then 



( 1 ) 
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We denote by bj b„ the basis for A given by Lemma 2. Let 

[I and the integers be given by Lemma 4, Corollary when = 
and let A' be the lattice with basis 



Then 



5' = {/i ntj) 1 by (1 ^ ^ . 



We now show that 



A(A') ^ 1 . 



Any point a of A' other than o may be put in the shape 
a = Uib[-\- ■ ■ ■ Ujb'j, Uj^O, 



where u^, are integers. Then 



( 2 ) 

(3) 



where 



{fintj)a = VjbiH bvyby 

= (t^/</), Vj = Uj+0 



are integers, since Uj and m jj-nij are integers. By Lemma 2, since Vj-=\= 0, 
we have 





F{fimja) T; Xj. 


Hence 


F(a)> >1 

limj 


This proves (3). 


Finally, 


< d(F), 

d(A') 



(4) 



by the definition of (5(F). The required inequality (1) now follows from 
(2), (3), (4) and the inequahty 







of Lemma 4, Corollary. 

A rather more detailed argument shows that the sign of equality 
in (1) cannot hold if F{x)<\ is a bounded star-body. Then it is possible 
to ensure that there are not n linearly independent points a of A' with 
F(a) =1, so A' cannot be critical, and there is inequality in (4). See 
Rogers (1949a). 

Cassels, Geometry of Numbers 



14 
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VIII. 3.3. We now show that the constant in Theorem II 

cannot be improved. For reasons of algebra we treat only the case 

M = 2. 

For general n see Mahler (1949a) or Chabauty (1949a). 

We first consider a point set which is not a star-body. Denote by 
the set of points 

<g'\ i±i,0) 

and by the set of points 

(s«i,sm2). 

where 

s^l; integers, ^ 2 = 4 = 0 . 

Finally, let ,9^ be the set of points which belong neither to nor 
to Clearly is open, and if any point x is in 6^, then rx is in 9^ 
for 0 ^ I r I ^ 1 : so y” has some of the attributes of a star-body. We shall 
later modify y slightly to obtain a set -5^ which actually is a star- 
body. 

There certainly exist y-admissible lattices A, i.e. lattices having 
only the origin o in y. For example the lattice Aj of points 

(2 , ^ 2 ) / 

where are integers, is y-admissible, since if Mj+O the point 

(2Mi, Mj) is in and if « 2 = 0 , but Mi=t=0, then [2u-^, u^) is in We 
shall next show that 

zi(y) =i(A 2 ) = 2 : ( 1 ) 

that is that every y-admissible lattice A has determinant d (A) ^ 2. 

Let A be any y-admissible lattice. By Minkowski’s convex body 
Theorem II of Chapter III, there is certainly a point x other than o 
of A in 

\x^\^2d{t\), 1 ^ 2 ]^ I-. 

This point is not in y, so must be in or and hence has the shape 

= (^1 1 > 0 ) > 1 =f= 0 . 

We may suppose without loss of generality that is primitive. There 
is then a vector 

f>2 ” (^12» ^ 22 )^^' 

which, with bj, forms a basis. Hence 

^11^22 — d: (A) d= 0 • 




General distance-functions 



211 



Since is in the ^-admissible lattice A, it must be in or so 

^12/^22 = rational. 

Similarly + is in or so (6n + 6i2)/622 is rational; and hence 

ill/62 2 = rational. 

There thus exists a real number |>0 and integers Bn, B12, B22 such 
that 

bi = (^ i^ii, 0) , &2 = (^ Bi 2, f B 22 ) • 

Without loss of generality, Bn, B12 and have no common divisor 
except ±1. 

Let V be the product of the primes which divide B22 but not B^^- 
Put 

Bi 2 = V Bn -f- Bi2. 

We wish to show that B^ is prime to B22' and must distinguish two 
cases for the prime divisors p of B 22- Up does not divide B12, then it 
divides^. If divides B12 then it does not divide Bn, since Bn, B^, B22 
have no non-trivial common divisor; and p does not divide v. In both 
cases p does not divide B^- Hence, on replacing 62 by &2+r’bi. we may 
suppose that B12 and B22 have no common non-trivial divisor. 

Now 62 is in the 5^-admissible lattice A, so is in or Hence 

since B12 and B22 have no common factor. Similarly b^ is in or 

and so 

Hence 

either |Bn|=l, 
or I Bill ^2, 1^1 

In either case, 

^f(A)=|BiiB22f^|^|Bii.“1^2. 

This concludes the proof of (1). 

We denote, as usual, by the set of points 

ux, xe^] 

and by Aq the lattice of points {u^, Mj) with integer u^,U2- Clearly if 
there are no points of Aq except o in if there 

are only the further points (±1,0) of Aq in while if fx>\, the 
points (±1,0) and (0, ±1) are m If were a star-body B(a;)<1, 

14* 
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these statements would imply that the successive minima of Aj were 
^ 1 = 2 “^ ^2 = 1. Hence 

which is the case of equality in Theorem II if is written for 

(5(F), the two being equal for star-bodies. 

It remains now to modify 6X’ so as to obtain a bounded star-body, 
in such a way that its successive minima with respect to Aq remain 
and 1, and so that its lattice constant is arbitrarily close to 2. We 
do this by replacing the lines in and by narrow wedges. 




Let £>0 be arbitrarily small. For any vector y = {yi, y 2 )=t= 0 , let 
(y) be the set of points x for which 

'Kiy)' + e-yxiy^-x^yil^yl + yl (2) 

Then (y) is an infinite wedge having a vertex at y, see Fig. 9. Its 
precise shape is not important. The two sides of the wedge make the 
small angle ± arc tan s with the outward radius vector from o to y. 

Now let be the set of points in (2^, 0) and (—2*, 0) and 
let be the set of points in («j _ u^) for any pair of integers with 

«2=1=0. Finally let be the set of points in 

i®l<e-i (I) 

which do not lie either in or in ‘8’". Clearly is a star-body, since 
there are only a finite number of the wedges composing and 
which have points in common with the disc (3). Indeed, by (2) and 
(3), the distance-function F^{x) associated with may be written down 
explicitly. 
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Since 9’^ is contained in y, and since the points (±2*, 0), (0, ±t) 
are evidently still boundary points of 5*^ , at least when c is small enough, 
it follows that the two minima and of [\ with respect to F^(x) 
are 2“^ and 1 respectively. 

Further, 

Indeed 

lim A(Sr\ =A{^)==2 

e-»-0 + 

by Theorem V of Chapter V, Hence there exist e such that 
2M(FJi(Ao)=2i(zl(5^,))-i 

is arbitrarily close to X^ Aj . This shows that for « = 2 the constant 
24 I«-i) = 2^ in Theorem II cannot be improved. 

VI 1 1. 4. Convex sets. We shall often have occasion to refer to the 
results of §3-I— 3-4 of Chapter IV and in particular to the properties 
of tac-planes. 

We first need a general lemma about convex functions. 

Lamma 5- Let F(x) be a symmetric convex distance function associated 
with a hounded convex body F{x)<i. Let c=t=o and let tt be the plane 
through the origin parallel to a tac-plane at c to F(x) <F(c). Then 

F{y+iasc)^/xF(y + sc) (1) 

for all y in tt, all real s, and all in 

If s=0 there is nothing to prove. Otherwise we may suppose, by 
homogeneity, that 

s = 1 , 

since s~^y is in tt if i/ is. Then 

F{y + c)^F{c), (2) 

by the definition of a tac-plane. Then, by convexity, 

F{y + c)^F(y+fic)+F{{i-fi)c} j 
= F{y + ^c) + {i-f^)F{c). J 

The required inequality (1) with s = 1 now follows from (2) and (3). 
We may now prove 

Theorem III. Let F{x) be a symmetric convex distance-function 
associated with a bounded body F{x)<\ and let X^, .... X„ be the successive 
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minima of a lattice A with respect to F. Then there is a lattice A' with 
determinant 

(4) 

and successive minima A,' (i^j^n), where 

(5) 

Let bj , . . . , 6„ be the basis for A given by Lemma 2. Let ± c be 
the points on the boundary of F(*) < 1 at which the tac-plane is parallel 
to the plane tt through bj , . . . , b„_j (Theorem IV of Chapter IV). Then 
every point in space can be uniquely put in the shape 

x = y + sc, j/€iT. (6) 

We put 

A — * 

and define A' to be the lattice of all points 

y+/isc, jz + sc^A. (7) 

Then (4) clearly holds. If s=(=0 in (7), the point i/+sc is not linearly 
dependent on bj , . . . , b„_j ; and so 

F(y + sc) 

Hence 

F{y+fisc)^/a?.„ = k„_i (s =(= 0) (8) 

by Lemma 5. On the other hand, the points of A' with s=0 are just 
the points of A which are linearly dependent on b^, , b„_i. Hence 
(8) implies (5). 

Corollary 1 . 

?.r^K<d(F)d{A). 

For in the proof of the Theorem put 

A = ^ll^n 



instead of Then (8) becomes 

F(y+ysc)^fi?.„ = ?., (s + O); (8') 

so 

F{a') ^ 

for all a'e A' except o. That is, 

F(A')^^,. 



d[A') =fj.d(A) =i^y{A). 



Further, 



(4') 
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F”(A')^d{F)d{/\') 

by the definition of 6 {F) ; and then the corollary follows from (4') 
and (8'). 

Corollary 2. 

We only sketch the proof. By varying fi in the proof of the theorem, 
we may obtain a lattice A' with successive minima AJ, where 

A'=A,- = 

and 

^(A')^i^p^(A). 

Then 

^1 • ■ • A;t 

d{A) = d{A') ■ 

Hence it is enough to prove the corollary when A„_i = A„. But it is 
easy to see that if two of the numbers rjj in Lemma 4 are equal, then 

the right-hand side of (t) of § 3-t may be replaced by y ~ ~ ~ ■ 

When this improvement is inserted in the proof of Theorem II, it gives 
the corollary. 

VIII. 4. 2. Before treating Minkowski’s estimates for the product of 
the successive minima of a bounded symmetric convex body in terms 
of the volume we must first prove a result, which we shall also use 
later, relating to convex bodies and the quotient space ^/A. We 
shall use the concepts and notation of Chapter VII. As was done 
there, we denote the points of ^ by small bold letters and those of 
iMJA by small gothic letters. 

Theorem IV. Let F(x) be a convex symmetric distance-function 
associated with a hounded convex set 



of volume 



6F\ F(ac)<1 

V,= V{^). 



(f) 

( 2 ) 



Let A be a lattice with successive minima , . . . , A„ with respect to F. 
For real t>0 denote by S[t) the set of which have at least one 

representative y in t£A (i.e. F(y)<t). Then the measure m{S(f)} of S(i) 
satisfies the inequality 

= t”Vp if 

^(iAd...(p^)r--^Fp if 
if 






( 3 ) 
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We first examine how the hypotheses and conclusion are affected 
by a homogeneous linear transformation t. Let A' = tA, F'(x) =F{x^^x). 
The successive minima of A' with respect to F' are the same as those 
of A with respect to F. Clearly 

F^, = |det(T)|F^, 

and by the remarks at the end of § 2.5 of Chapter VII we have 
m' {t S (f)} = I det(x) I m{S(f)}. 

where TS(f) is the image of S(t) in the natural mapping of ^/A onto 
^/tA; and m' is the measure in ^/tA. But TS(f) =S'(t) is the set in 
^/tA defined in respect of F' and A' as S(t) was defined in terms of 
F and A. Hence a homogeneous linear transformation multiplies both 
sides of (3) by the same factor |det(x)|. 

We may therefore suppose without loss of generality that the basis 
bj, 5„ for A given by Lemma 2 is just 



By = (o, ...,0,1,0, 



(4) 



and that A = Aq is the lattice of points with integer coordinates. 

We now obtain a formula for m {S (/)} valid when 

(5) 

and 7 = 1,2, ...,« — 1. Let 

~ (%1> •••• ^nl)> ®2 ■■■I 

be two points of F{x)<t^^kj+i', suppose that 



Then 



Xi — Xj (^o) • 

F(Xi - Xj) ^ F(Xi) + F(Xij) <Xj+i. 



Since Xj — Xg 6 Aq , we have now 



( 6 ) 



*,i = x,-2 (j > J) , (7) 

by (4). Further, 

{Xii, . , Xji) = {Xi2, • , XJ 2 ) (A^) , (8) 

where A^ is the /-dimensional lattice of points with integral co-ordinates. 
Clearly (7) and (8) together imply (6). Denote by the /-dimensional 
euclidean space and by n\j the measure in ^y/A^. For given [n-J)- 
dimensional vector « = (zy , . . . , , denote by Sy [t, z) the set of points 

of which contain representatives (% , . . . , Xy) e such that 

F(Xi,...,Xy,Zy + i,...,Zj<L (9) 

Then we assert that (5) implies 

m{S[t)] = S mj{Sj{t,z)}dz [dz = dzj^^. . . dz„) . 



( 10 ) 
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In the first place, Sj(t,z) certainly has a /-dimensional measure, since 
F(x) is continuous by its definition as a distance function. Then, if z 
runs through all (« — /) -dimensional space and J/ = (yi, - yj) runs for 
each z through a complete set of representatives for Sj(t, z), it follows 
from the equivalence of (6) to (7) and (8), that 

^ = iVl yj.^j+l ^n) 

runs through a complete set of representatives for S{t). We may, for 
example, normalize the y by taking always 0^yy<t This 

proves (10). 

The next stage is to show that if s is any number ^ 1 , so 

Q<i^st, (11) 

ir>j{Sj{st.sz)}^mj{Sj{t.z)} (12) 

for any (« — /) -dimensional vector z. This is certainly true if the right- 
hand side of (12) is 0. Otherwise, there is some /-dimensional vector 
?/o=(yio. •••. y;o) such that 

F(yo.^)<i 

where, in an obvious notation, ^) = (yio- • ••> y/o- • • •. ^„) : and 
similarly later. Let y be any /-dimensional vector with 

F{y, z)<t. 

Then by the convexity and homogeneity of F{x), we have 
P{y + (s - 1) J/o. sz]=F{{y, ») -f (s - 1) (^0. «)} 
^F{y,z)F[s-\)F{y^,z) 

<f-f (s -1)f 
= st. 

Hence, if y runs through a complete set of representatives for Sj{t, z), 
then y -|- (s — 1)j/o runs through representatives of distinct elements^ of 
Sj{st, sz), when y^ is kept fixed. This proves (12). 

Suppose, now, that 

(13) 

Then, by (10) and (12) we have 

m{S(si)} = / mj{Sj{st,z)} dz 

= s’'-J f mj{Sj{st,sz)}dz 
J mj{Sj{t,z)}dz 

=s"Wm{S(/)}, 

1 Of course not every element of Sj{st,sz) necessarily has a representative 
of the type J/+(5— l)t/o- What is important, is that distinct mod give 
distinct y+{s— 1)j/q mod AJ. 
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where in the second line we have replaced z hy sz and in the third 
line we have used (12). 

When 

(15) 

we have the simple equation 

m{S{t)) = V{t 9 ’) (16) 

where tS^ is the set F{x)<t. Indeed, if and x^ are any two points 
of with x^=x^ (Aq), we have 

F(Xy - Xi) ^ F{Xj) + F{X2) < 2f ^ Ai ; 

and so x^ = X2- 

We may now prove (3). For the truth of (3) follows from 

(1 6) . Suppose that (3) is already proved for f ^ , where 1 ^ ^ n — 1 . 

Its truth in the range then follows from (13) and (14) 

with t = \Xj. Finally, the truth of (2) for i^^X„ is trivial, since (ii) 
includes ^"(^2) if h = ^2- ^nd hence m(S(<)} increases with i. 

VIII. 4.3. Theorem IV provides the kernel of the proof of the follow- 
ing theorem of Minkowski. 

Theorem V. Le( F(x) he a symmetric convex distance-function as- 
sociated with the bounded set F[x)<\ of volume Vp. Let X^, ..., A„ he the 
successive minima of a lattice A with respect to F. Then 

^d[N)^X,...X„Vp^2”d(h). ( 1 ) 

In Theorem IV the measure m{S(f)} for any t can be at most the 
measure of the whole space namely d (A). On applying this remark 
when t = jX„ to the inequality (3) of §4.2 we get the right-hand side 
of (1) at once. 

Now let Oj, ..., o„ be the linearly independent points of A with 

F(a^j =Ay 

given by Lemma 1. By the homogeneity and convexity of F(x), all 
points 

X =tiai + --- -ht„a„ (2) 

such that 

(3) 

lie in F(ac)<l. Hence F' where V is the volume of the set of (2) 
subject to (3). But clearly 

F' = |;idet(a„...,a,.)l =^i<f(A), 



(4) 
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where 1 is the index of a^, o„ in A. This proves the left-hand side 
of (1 ) , since 7^1. 

Corollary. The index I of a^, a, ^ is at most n\. 

This follows from (4) and the right-hand side of (1). (Compare the 
proof of Theorem X of Chapter V.) 

VIII. 5. Polar convex bodies. Let A* and F* be the respective polars 
of the lattice A (Chapter I, § 5) and the symmetric convex distance- 
function F (Chapter IV, §3). Mahler (1939b) has shown that the 
successive minima of A* with respect to F* are determined by the suc- 
cessive minima of A with respect to F apart from factors which have 
bounds depending only on the dimension n. Thus relationship will be 
exploited in Chapter XI in the discussion of inhomogeneous problems 
and is of importance in other contexts. The theorem is, of course, 
closely related to Theorem VI of Chapter IV dealing with the lattice 
constants of mutually polar convex bodies. 

Theorem VI. Let Ai, ..., be the successive minima of a lattice A 
with respect to the symmetric convex distance-function F and let X*, 
be the successive minima of the polar lattice A* with respect to the distance- 
function F* polar to F. Then 

(1^/^n). (1) 

We attack first the left-hand inequality. By Lemma 1 there exist 
linearly independent vectors Oy, a* of A and A* respectively such that 

F(a,)=Ay, F*(a*)=X*. (2) 

By Theorem III of Chapter IV we have 

F{x) F*{x*) ^ XX* 

(scalar product) for any two vectors x and x*. On applying this 
a; = ±o, , x*= for any pair of indices i, j we have 

X,X*^\a,af\, (3) 

since F{x) and F*{x) are symmetric. But a^af is an integer by Lemma 5 
of Chapter 1 , and so 

either A, A* S 1 or a, a* = 0 . (4) 

Let 7 be a fixed index. The vectors x such that xa* =0 (l^t^7) 
form an [n — 7) -dimensional subspace. Hence by the linear independence 
of the Oy there is some Oy with /^n-(-l —7 which does not lie in this 
subspace; that is 



fly fly =j= 0 
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for some i, j with 

Then Ay^A„ + i_;, and so, by (4), 

A„^i_;A?^A,A?^1. 

Since this is true for any I, this gives the left-hand inequality of (1). 

We now prove the right-hand inequality in the enunciation. Let 
a, be as above. Then (cf. Chapter I, § 5) there are n primitive 

vectors h* of A* such that 

a,b*=0 (f#=7). (5) 

Since the a, are linearly independent, the n equations a,a;* = 0 are 
satisfied only by x* = o: and so 

tt; 6f =1= 0 (1 ^ f ^ «) . (6) 

Hence the b* are hnearly independent. 

By Theorem III, CoroUary 1 of Chapter IV, there are vectors Xj 
such that 

F(x,)F*(6f)=x,5f. (7) 

Without loss of generality 

x^bf=i (8) 

The next stage is to show that for fixed J the determinant Dj formed 

from Xj and the a, (f =f= /) has absolute value at least i(A). For fixed /, 
there is a basis for A* with 

cj = (9) 

Let C; be the polar basis, so that, by (5) and (9), 

«. = 2 ^,i^i (» + /) Oo) 

for some integers Further, 

Xj=±c„+ 2 i,c,- 

for some real numbers tj, by (8) and (9)- Hence 

Z)y = |det(aj a,_i,Xj.aj^^, ...,a„)\ = \det | det (c,, . . ., c„)| . 
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The first factor here is a non-zero integer since the are linearly in- 
dependent; the second factor is just d{A). Thus 



as required. 
The points 

with 



Dj^d[A), 



tjXj + 2 
•'+/ 

tj\F{Xj) + Z\ii\F{a,)<i 



(11) 



lie all in the set F{x)<\ of volume Vp. This set of points has volume 



^Dj[F{Xj)UF[a,)y\ 

Hence, and by (11), 

VpF{Xj)UF[a,)^^d[A). (12) 

.+/ 

But F(a,)=A, and Vp[J Xi^2”d{A) by Theorem V, so 

i 

F(Xj) ^ Xjjnl, 

and finally 

F*(bf)^nnj\ ( 13 ) 

by (7), (8). The inequality (I 3 ) holds for each integer / and for the 
independent vectors bj of A*. 

Now ^ and so, for each integer /, there are the «-f 1 — / 

linearly independent vectors 5* = 6* (J^j^n) of A* such that F[b*) 
^m!A 7‘. By the definition of AJ+i_^ it follows that 

+ ^ Ay 

This is the required inequahty and so concludes the proof of the theorem. 

The applications of the theorem are usually only qualitative so the 
magnitude of the factor n\ on the right-hand side is usually irrelevant. 
Mahler (1939 b) showed that the weaker inequality 



can be deduced very simply from the left-hand inequalities. Theorem V 
and Theorem VI of Chapter IV. We have 



FyAi...A„^2”i(A), 
VpAt ...Xt^2’'d{A*), 
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and so 



Now 






d(A)ci{A*) 

by Lemma 5 of Chapter I, and 



VpVp,^. 



(«!)2 



by Theorem VI of Chapter IV. Further, 

i 

for any particular J by the left-hand inequality of Theorem VI. Hence 
XjX*_j^[n\Y, as required. 

VIII. 5 . 2 . In Chapter XI we shall also need the following result of 
which the proof is similar to that of Theorem VI. 

Theorem VII. Let F(x) and F*[x) be polar symmetric convex distance 
functions. Let bj, .... be any basis of a lattice A and b*, .... 5 * the 
polar basis of the polar lattice A*. Then 

2’'d{A)F*{bJ)^n\VpYlP{bi) (1) 

i + J 

for each integer J = \,2, ... ,n. 

For the deduction of (12) from ( 5 ) and (6) in § 5-1 did not depend 
on the fact that the Oy gave the successive minima for F. Hence (12) 
of § 5.1 remains true if by is read for Oy, where Xj is to be given by (7) 
and (8) of § 5 . 1 . On substituting ( 7 ) and (8) into (12) of § 5-1 the required 
result follows. 

Corollary [M. Riesz (1936a), K. Mahler (1939a, b)]. LetX^,...,X„ 
be the successive minima of F with respect to A. Then the basis by may be 
chosen so that 

j 

2F{b,) ^ J Ay (2^'i^n) I 

and 

F(by)F*(bf)^(i)'-i(«!) 2 . ( 3 ) 



The existence of a basis by satisfying (2) follows at once from Lemma 8 
of Chapter V on defining Oy there to be the linearly independent points 
with F(Oy) =Ay. But now on multiplying (1) by F(by) and using Theo- 
rem V, we have 

2''d[A) F{bj) F*{bJ) ^nWpTl F{b^) g (i)"-*(« '-YVp JJ ^2{n\Yd{A). 

lg;gn lg;^n 
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Chapter IX 

Packings 

IX. 1. Introduction. If is any tt-dimensional set and y a point, 
we denote by +y the set of points 

y + y: x + y, (1) 

By a packing of y in some other set y we shall mean a collection of 
.sets 

^r=^+Vr. ( 2 ) 

each of which is contained in y, and no two of which have points in 
common. If y is the whole space we speak simply of a packing 
of y. If the y^ in (2) run through the points of a lattice A then we 
say that the packing is a lattice packing. In this chapter we examine the 
consequences of these ideas for the geometry of numbers. This chapter may 
be regarded as a sequel of Chapter III but we shall also require some of 
the general properties of convex bodies discussed in Chapter IV. We 
shall find that the general theory of packings is relevant even to strictly 
lattice-theoretic problems. 

There is an admirable account of the theory of packing in Fejes 
Toth (1953 a) and a conspectus of the more important results in 
Bambah and Rogers (1952a). 

IX. 1.2. The three following theorems show the relevance of packings 
to the theory of Chapter III. We give the simple proofs here 

Theorem I. A necessary and sufficient condition that the lattice A 
give a packing of the set y is that no difference x^—x^ of two distinct 
points of y belong to A. 

Suppose, first, that x^ — x^ — a^/K. Then the sets y = y+o and 
y + a both contain the point = and so overlap. Conversely, 

suppose that the sets y +aj and y -f-Oj have the point y in common 
where %, Oj are in A. Then the two points y — a^ = Xi, y — a^^x^ are 
in y, and their difference Oj— Oj is in A. 

Blichfeldt’s Theorem I of Chapter III shows that 

F(y)gi(A) 

whenever A packs y. The following theorem shows when the sign of 
equality can occur. To avoid irrelevant topological considerations we 
confine attention to rather special sets y. 

Theorem II. Let LA be a bounded open star-body and A a lattice with 

F(y)=^(A). (1) 
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(A) If A packs , then every point in space either belongs to precisely 
one set y -\- a, aaA and is not a boundary point of any other y +a, 
or is a boundary point of at least two such sets y +a. 

(B) If every point of space either belongs to or is a boundary point of 
at least one set y + a, then A packs y . 

By hypothesis, there is an R such that y is contained in 

1*1 <R. 

We now prove (A). Suppose, first, that A packs y and that there 
is some point y which is not in or on the boundary of any y +a, aeA. 
We may choose e in the range 0<£<1, so small, that the sphere 
of points X with 

^1- |*-y|<£ (2) 

is completely outside the finite number of bodies y+a with agA and 
|a — y|<f?-|-1. By the definition of i?, the set y + a certainly contains 
no points x of if |a — + We may suppose, further, that e 
is so small that the only point of A in |*| <2e is o. Let 

y' = y^y^ 

be the set of points belonging to either y or ,5^. Clearly, if and x^ 
are distinct points of y' the difference x^—x^ cannot belong to A. 
Hence 

F(y')^^f(A) 

by Blichfeldt’s Theorem I of Chapter III. But then F(y)<F(y'), 
which contradicts the hypothesis. Suppose now that the hypotheses 
of (A) are fulfilled and that there is a point y which is on the boundary 
of precisely one y +a, a 6 A. Suppose, without loss of generality, 
that y is on the boundary of y. As before, there is an £>0 such that 

defined in (2) contains no point or boundary point of any y+a 
with aeA, a =1=0. But then the point {\+rj)y, for sufficiently small 
rj>0, is in and is not a point or boundary point of y. On taking 
(1 +rf)y instead of y, we thus have the case first considered. No point 
can belong to more than one y+ a, ae A by the definition of a packing. 
If y were a point of y+a and a boundary point of y+6, where 
a, be A, then there would be points in the neighbourhood of y in both 
y+a and y + 6, since y is open. This completes the proof of (A). 

We now prove (B). If y is not packed, then, by Theorem I, there 
are points and *2 such that 

o + ao = *j — *2 6 A . 
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Since ^ is open, by hypothesis, there is an e>0 such that both spheres 

^ : I a; — a?! I < e , 

. j a? 3^2 j e 

are contained in y”. We may suppose that e is so small that ^ and 9’^ 
have no points in common. Let i/” be the set of points which belong 
to y but not to Clearly every point in space is either an inner 
point or a boundary point of ^'+a for some a 6 A, since every point 
of ^ is either in S/” or in ,9^'+ a,. Let S/” be the closure of y". Since 
y” is a star-body and ,5^ is a subset of if , we have 

V[^') = F(y”) < V{6^) = i(A) . 

This is a contradiction with the Corollary to Theorem I of Chapter III 
since we are supposing that every point, and so every point of the 
fundamental parallelogram, is of the form «+a where and a 6 A. 

This completes the proof of Theorem II. 

Theorem III. A necessary and sufficient condition that the convex 
symmetric set ^ admit the lattice A is that A give a lattice packing of 

This follows at once from Theorem I and Theorem II, Corollary of 
Chapter III. 

We shall consider only packings of convex sets in what follows, 
and we shall suppose that is symmetric, whenever this gives any 
simplification of proofs or results. 

IX. 1.3. Minkowski’s convex body Theorem II of Chapter III states 
that if is an «-dimensional symmetric convex body of volume 
V(y) >2”d{A), then the lattice A cannot be .9^-admissible. In § 2 we 
discuss when a lattice can be admissible for a convex symmetric 
body of volume 2”d(A^). Of course then by Minkowski’s convex body 
theorem we have 

A(AA) =2~’'V{6A), (t) 

and the lattice A^ is critical. 

Even when SA is the cube |xy| < t (1 the critical lattices were 

not completely known until Hajos (1942) confirmed on old conjecture 
of Minkowski. We quote the result here, but shall not prove it since 
it depends on considerations of group-algebra remote from the other 
topics in the book. 

Theorem IV. A necessary and sufficient condition that a lattice A 
be critical for \x ;l<^ (l^/^«) is that, after a suitable permutation of 

Cassels, Geometry of Numbers 15 
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the axes of co-ordinates, it has a basis of the shape 

bi = (l,0 0) 

^ 2 =(^ 12 >^> 0 , •••) 0 ) 



&«=(*!» K-l.n> 1 )- • 

The reader will readily verify that a lattice of the stated kind has 
determinant t and no points other than o in (\^j^n). For 

the proof of the converse the reader is referred to the original paper of 
Hajos (1942) and to Redei (1955 a) where there are references to the 
considerable amount of later literature. We proved Haj6s’ Theorem for 
n=2 incidentally as Lemma 7 of Chapter III. 

Minkowski (1896a) showed that any convex symmetric set 6^ with 
A{y) = 2~"V{6^) must have very special properties, for example that 
it must be a polyhedron bounded by at most 2”— 1 pairs of hyperplane 
faces. We prove this in § 2. 

IX.1.4. VoRONo'i (1908a) suggested a simple way of finding open 
convex symmetric sets y such that 

F(y)=^(A) 

and which are packed by a given lattice A. If g{x) is any positive 
definite quadratic form, the set of points such that 

g(ac)<infg(a:+a) 

a6A 

« =# o 

has this property. The condition 

g(x) < g{x + a) , 

for any given a, is linear in the coefficients x^, x„; so y is convex, 
y is clearly symmetric. It is not difficult to verify that y is, in fact, 
bounded; and that then the infimum in (1) may be replaced by a 
minimum over a finite number of a depending on A and the function 
g{x), but not otherwise on the individual x. Not every open convex 
symmetric y with F(y)=2”cf(A) for which A is admissible may be 
obtained in this way, but Voronoi was able to show that all, in a 
sense, sufficiently general such y could be. Unfortunately the excluded 
cases include some of great interest, such as those covered by Haj6s's 
Theorem IV. 

We do not discuss the case of general dimension n in this book but 
deal in detail with w=2 in §3. As a byproduct we obtain a result 
about the inhomogeneous problem for definite binary quadratics. 
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IX. 1.5. Let JT be any open 2-dimensional set and the 3 -dimen- 
sional set of points 

(x^,X^,Xs) (Xi,X2)eJf |«3|<'l; (1) 

that is, a generalized cylinder of height 2 and with cross-section JT. 
Then a plane section 

%3 = constant 

of a lattice packing of 'if gives, in an obvious way, a packing of JT, 
but not necessarily a lattice packing. The idea of using non-lattice 
packings in this context is apparently Mahler’s (1946g). In this way 
we are led to consider non-lattice packings of 2-dimensional sets. This 
we do in § 5, after some preparatory lemmas in §4. It turns out, as 
was proved independently by Rogers (1951a) and Fejes Toth (1950a) 
[see also Fejes Toth (1953a)] that, in a sense which will be made 
precise, no packing of convex symmetric open sets is closer than the 
closest lattice packing. It appears unhkely that this result extends to 
higher dimensions. For a discussion of this point see Fejes Toth 

(1953a). 

In § 6 we use the packing results to show that 

d('g') =Zl(Jf), (2) 

when jT is convex and symmetric and is defined in (1). This result 
was originally proved independently by Chalk and Rogers (1948a) 
and Yeh (1948a). An example was given by Rogers (1949b) which 
shows that (2) need not hold when JT is a symmetric non-convex 
2-dimensional star-body, and Davenport and Rogers (1950b) gave an 
example to show that then the ratio A(if)IA{Jf) may be arbitrary small. 
Varnavides (1948a) has shown that (2) continues to hold in one inter- 
esting non-convex case. It is trivial that A{'if)-^A{Jf) for any JT, since 
if A is a 2-dimensional admissible lattice for"^, the 3 -dimensional lattice 
of points 

{x^.x^.x^) (%,V 2 )€A, Xa = integer 

is clearly admissible for and has the same determinant as A. 

There is an interesting unsolved problem in this connection. Let 
Jfi and JTj be convex symmetric bodies in and n.^ dimensions respec- 
tively and let be the (M^-fMj) -dimensional “topological product” of 
Jf] and JTj; that is the set of points 

x = {y,z), seJfa- 

The argument above shows that 

ACif)^A{X,)A(jr,). (3) 



15 ‘ 
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Can it ever happen that there is strict inequality here ? The cylinder 
is, of course, the case «j= 2 , «2 = 1. Woods (1958 a) has shown that there 
is equality in ( 3 ) for «i = 3 , «2=1 when a. 3 -dimensional sphere. 

IX. 1 . 6 . In §§ 7, 8 are given applications by Blichfeldt techniques 
based on packing considerations, or at least Blichfeldt’s Theorem I 
of Chapter III, to the estimation of the lattice constants of the sets 

^1 + ■ • ■ + < 1 



respectively. The relationship of Blichfeldt’s results to later work 
will be discussed there. 

IX. 2 . Sets vuith V{y) =2”A(6^). We prove here the following result 
of Minkowski (1896a). 

Theorem V. Let he an open symmetric n-dimensional convex set 
which admits a lattice A with d(A) = 2~”V{y). Then y is defined by 
m ^ 2” — 1 inequalities^ of the shape 

(f) 

I j \ 

For each I (1 w) the planes 

2//,'^/=±l (2) 

i 

give an {n — i )-dimensional pair of faces of ^ , and each such face 
contains a point of A as an inner point (i.e. for each I there are lattice 
points satisfying (2), and (1) for i=^I). 

By Lemma 4 of Chapter IV the set , 9 ” is bounded since 0 < Ffy”) < oo . 
By Theorems II and III, every point either belongs to precisely 
one set 

J"(a): 1 . 5 ^ + a, aeA, 

in which case it is not a boundary point of any y (6), 66 A or it is a 
boundary point of at least two ^{a). Hence every boundary point of 
^(o) = is also a boundary point of some ^{a), a=)=o: and, by the 
boundedness of y, only a finite number of a can occur in this way. 

We note now that, for fixed a, the set of points which are on the 
boundary of both ^(o) and ^(a) is convex. For if x, y are two such 
points, the point 

tx + (.i-t)y {0 <t<i) ( 3 ) 

1 In fact there are at most 3" — 3 faces [Groemer, MZ 79 (1962) 364 — 375]. 
both and its faces are centrally symmetric. Estimate 3“ — 1 is easy (Hlawka, 
1949a). Both Groemer and Hlawka give generalizations. 
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is certainly either a boundary point of .^(a) or belongs to ^(a) by 
convexity, and similarly for ^(o). Hence (3) is a boundary point of 
both ^(o) and ^(a) by Theorem II. 

In particular, if z is common to the boundaries of ^(o) and ^(a) 
then so is^ a — z by the symmetry of Hence so is also 

ia =i« + i(a-«) 

a common boundary point. 

Denote by 

±c, (i£k^K) (4) 

the points c of A such that the boundary of ^(c) has n linearly inde- 
pendent points in common® with that of .^(o), and denote by 



± 6 ; 



( 5 ) 



the remaining points b of A such that the boundaries of .^(b) and 
,^(0) have points in common. From what has just been shown, the 
points common to the boundaries of ^{0} and .^(b,) lie in a linear 
subspace of dimension at most n — 2 (not, of course, necessarily, passing 
through the origin. In fact, it cannot pass through the origin). 

We show now that every boundary point z of ^{o) is also a boundary 
point of a ^(c*). The set of boundary points x of ^{o) in any neigh- 
bourhood 

1 « — »!<£ ( 6 ) 

of z is (« — t)-dimensional, and so cannot be exhausted by the at most 
(m — 2) -dimensional sets of boundary points in common with the ^(b,). 
Hence there must be points in (6) which are common boundary points 
of ^(o) and a ^{c^. Thus z itself is a boundary point with a 
as required, since there are only a finite number of c^. 

[More precisely, let y be F{x)< 1, where F{x) is a distance-function. We may 
suppose, without loss of generality, that *= (1, 0, ..., 0). If « is common to the 
boundary of y(o) and y(bi), the common boundary points of y{o) and .^(bj) 
satisfy at least two distinct equations 



and so at least one equation 



1 ) + 0 , 

1^2 



)S2 



There is an equation of this type for each I for which * is on the boundary of .^(bj) . 
If 4^2 , ..., are chosen so as not to satisfy any of these conditions, and arbitrarily 

1 — * is on the boundary of y(o), by symmetry, and then a — z is on the 
boundary of y{a). 

® That is, the common boundary of y{o) and .^(c) is a convex {n — l)-dimen- 
sional set with centre ic, by what has been already proved. 
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small, then the point 

■J {-F(l, ^2 ^n)} ^ (t» ^2» • • • > ^n) 

is arbitrarily close to z and not on the boundary of any S~(bi).] 

Now we consider the boundary common to ^{o) and ^(c*). We 
saw already that is one point of the common boundary. Let 

-2-c*, (7) 

be n linearly independent points on the common boundary. (They exist 
by the definition of the c*.) Then the points 

\<^k-yki 

are also on the common boundary, by symmetry; and hence, by con- 
vexity so are all points 



C*+ 2 ijUki 

lg;gn-l 


(8) 




(8') 



Let TT* be the (hyper) plane through and the \Cf,±_y^y Clearly any 
plane other than through contains points of ^{o)\ and so tt* 
must be the only tac-plane to ^{o) at \c^. The equation of tt^ may be 
written in the shape 

ZlkjXj = \. (9) 

IS/Sn 

since tt* cannot pass through the inner point o of ^{o). The correspond- 
ing tac-plane — tt* through — is obtained by changing the sign of 
the /*y in (9). Hence every point of the open set ^{o) satisfies the 
inequalities 

\Ztk,Xi\<\- ('to) 

Further, every point y, which does not belong to .T{o) is of the 
shape y = ty^, where 1 and j/q is a boundary point. We saw already 
that every boundary point of ^{o) is also a boundary point to some 

± Cj) and so satisfies 

for this k. Hence y^, and a fortiori y ; cannot satisfy (10). Thus ^(o) 
is precisely the set of x which satisfy (10). Since S^ = 2^{o), the 
corresponding equations for are (1). 

* The point (8) is 

‘o(kf^k)+ 2 \t,-\iick±ykj). 

where the + prefixed to is the sign of tj, and 

<0+ Ihl d h = 1- 
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Some of the inequalities (10) may be identical, since it is quite 
possible that the pairs of tac-planes ± it* may be the same for distinct k. 
We may suppose that (10) for l^^^w gives a complete set of distinct 
inequalities, where m^K. We saw that there is a unique tac-plane at 
c*, and so, since the planes ±iT( (1^/^w, l=^k) are certainly tac- 
planes and are distinct from tt,^, they cannot pass through c,,. Hence 
a: = c* satisfies 

|2//y^/|<i l=¥k), (11) 

' i ' 

and 

I Zj ~ “2"- (12) 

To complete the proof of the theorem, it remains to show that 
w^2"— 1. As in the proof of Theorem IX of Chapter V, it is enough 
to show that the points are not in A for But 

from what has just been proved, the point 2 (^4— c,) certainly satisfies 
[ 2 /i , I < t , for 1 ^ ^ w, there being strict inequality iox I =k,r because 

then (11) holds for x=lc,., respectively. Hence -J (c*— c,)ey’, so 
cannot be in A, since A is ^-admissible by hypothesis. 

IX. 2. 2. When « =2, it is possible to specify completely the convex 
symmetric sets with A(^) 

Theorem VI. A necessary and sufficient condition that the lattice A 
be admissible for the convex open symmetric 2-dimensional set ^ with 

V[6A)=Ad[N) 

is that either 

(i) y is a parallelogram and A is generated by a mid-point of one 
side and a point on one of the other pair of sides or 

(ii) SA is a hexagon and A is the lattice generated by the mid-points 
of any two non-opposite sides. Then A contains the mid-points of all the 
sides. 

That ,5^ is a parallelogram or hexagon follows from Theorem V, 
since 2”— 1=3 for n = 2. The lattices A are critical by Minkowski’s 
convex body theorem. The critical lattices of parallelograms and 
hexagons have already been determined in Lemma 7 of Chapter III and 
Lemma I 3 of Chapter V respectively. 

IX. 3. VORONOi’s results. We already saw in § 1 that if g{x) is a 
positive definite quadratic form and A a lattice, then the set of points 
such that 

?(*) < inf g(aj -b a) 

a€A 

=♦=0 
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form a convex symmetric body ^ of volume 2”d{N). We shall show 
that when w=2 every symmetric convex hexagon and its unique 
critical lattice can be related in this way by a suitable quadratic form 
g(x). On the other hand, if y* is a parallelogram, then A must be the 
particular critical lattice generated by the mid-points of the sides. 

These results are clearly invariant under homogeneous linear trans- 
formation so we may suppose without loss of generahty that A = Aq 
is the lattice of points with integral co-ordinates and that 

g(xi, Xj) =axi + 2hXiX2 + bxl 
is reduced, in the sense that 

Q^-2h^a^b. (1) 

If «i, are integers not both 0, the condition 



<g(*i-«i. ^2-«2) (2) 

is 

2{mj (a Xi + a Xa) -f (/« Xj -f 6 x^)} < gK , Mj) . 



Since ( — «i, — « 2 ) occurs as well as {u^, u^), we thus have the infinitely 
many conditions 

2|«iZi-l-M2^2l <gK-«2). (3) 

where 



Wj = fl Xj -f A Xj , X^ = hx^-\- b x^. 

In particular, 

2|Xi|<a 

2\X2\<b 

2 1 Xj -)- W jI < a -j- 2/( + i = c (say), 

where 



(4) 

(5) 



0<a^b^c^a-\-b. 



(!') 



The set Jf defined by (5) is a proper hexagon unless h = 0, when it 
degenerates into a parallelogram. The area V of is readily com- 
puted from (4) and (5) to be 



V(je^)=4=4d(\). 



But y is a subset of Jt" and V(^) =4, by Theorem II. Hence y =.??’, 
since both are open. This implies that the infinitely many inequahties 
(3) all follow from (5), which the reader may verify directly with httle 
trouble. 

Further, every non-degenerate convex symmetric hexagon with 
its critical lattice may be generated in this way, as we now show. The 
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hexagon is given by three inequalities 

(/ = '!. 2, 3). (6) 

where 

h = iki> hj) 

and 

X — y “h ^2 j ^2 

is the scalar product. The three 2-dimensional vectors ly are linearly 
dependent and, by multiplying them by suitable factors, we may suppose 
without loss of generality that 

^1 + ^2 3 “ ^ 0 , 

and, on re-indexing, that 

^2 — ^3- (7) 

On taking X^ = ljX (/ = t, 2), the inequalities (6) become 

] 

\X,\<k, (8) 

+ </%3. J 

Further, 

^3 <C ^1 -f- k^, 

since the hexagon 3^ is not degeneratic, by hypothesis. VVe may identify 
(8) and (5) by putting 

2k^ = a, 2 k 2 = b, 2k^ = c = a 2h b , 

though of course the %, in (4) are not necessarily to be identified 
with the in (6) . Let x'^ , x'^ be defined in terms of X^ , by the 

analogue 

Xi = a x'l h x'2, X2 = h x'l b x'2 

of (4). On comparing with the earlier part of this section, we see that 
the unique critical lattice of must be given by integral values of 
x[, x'2. We may thus suppose, without loss of generality, that (x'l, x'2) 
was in fact the original co-ordinate system {xi, x^, and then we have 
the situation already discussed. 

IX. 3. 2. From the results of § 3 .I we deduce the so-called “hexagon- 
lemma” of Dirichlet^ which illustrates the connection between homo- 
geneous and inhomogeneous problems that will be discussed in more 
detail in Chapter XI. 

’ For an alternative derivation of the lemma and a partial generalization to 
n dimensions, see Mordell (1956a). 
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Theorem VII. Let 

g[Xi,x^) = ax\ + 2hXiX^-\-bx\ ( 1 ) 

be a quadratic form, reduced in the sense that 

0^-2 h^a^b. (2) 

Then to every real point »o=(^io> ^20) ® point m = (mj, m,) with 

integer co-ordinates, such that 

d{ab — h!‘‘)g{Xf, + u)^abc, c = a-{-2h + b. (3) 

The sign of equality is required when and only when 

2{ab — K^) (£Co + = ± [b(a ^h), —a[b + h)], (4) 

where v has integral co-ordinates. 

For by the results of § 3.I and by Theorem II there is certainly a 
point ®q + m with integral u in the closed hexagon 

2|Zi|ga, 21X21^6, 2 |Xi + X2|^c, 

where 

Xj == flXj + /tXj, X2 = hx^ + bx^. (5) 

But the positive definite quadratic form g{x) can reach its maximum 
in only at the vertices^ of 3^. It is now readily verified that the 
vertices are of the shape (4) and that the value of g {x) at all the vertices 
is given by the right-hand side of (3). The calculations are facilitated 
by the identity 

g{X„-X,)=(ab-h^)g(x), 
where Xj, Xj are given by (5). 

Finally, the ^ in (3) cannot be replaced by < if ajg is any vertex 
of J^, since 

g{x) = infg(a;-f w) 

US A, 

for the points x of This last remark also shows that it was sufficient 
to compute g(x) at any one vertex x^ (say) since, from the nature of a 
critical lattice, all the other vertices are of the shape ±®i + w. where w 
has integral co-ordinates. 

IX. 3.3. Theorem VII gives yet another proof of the result that a 
definite ternary quadratic form f{x) represents an number ag(2D)ifor 
integral values of the variables not all 0, where D is the determinant 
of f{x) (§ 3.4 of Chapter II). We may suppose, without loss of generality, 

* Perhaps the easiest way to see this is to make a homogeneous linear trans- 
formation y = rx so that g(af!) = 1 when it is obvious. 
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that /(ac) is reduced in Minkowski’s sense (cf. Chapter II, § 2.1). We have 

f{x) =ax\ + hx\ + cx\-\-2hx-i^Xi+2gx^x^-\-2fx^x^ ] 

c t (^) 

= a{xi + xxsf + 2h(xi + a. x^) (x^ + Pxs) +b{x2 + fi x^f + yxll 
for some a, /?, y. We may suppose that h^O. Then 



0^-2h^a^b, 

and 

/(mi,«2, 1) S 6 



( 2 ) 

(3) 



for all integers u-^,u^, by the condition of the reduction. 
Theorem VII, we may choose %, Mj so that 



1) S 

Hence from (1), (3), (4) we have 



oi)(a + 2A + 6) 



+ y. 



But now, by 



(4) 



4D = 4{ab ~ h^)y^ 4b{ab — h^) — ab {a 4- 2h + b) 



Now 

by (2) ; and so 



■b(2h + Xa)^ + 'iab^-^aH. 

\2h + J-a) g la, 

4D ^ }ab^ — a^b^ 2ab^'^ 2a®, 



by a further application of (2). This is the required result. Further, 
using the knowledge of the cases of equality in Theorem VII, it is easily 
verified that 2D = o4 can occur only for forms equivalent to multiples 
of the critical form 

^1 + ^2 + ^3 — % «2 — ^2 ^3 — X3X1 . 

IX. 4. Preparatory lemmas. In §§ 5, 6 we shall need three lemmas, 
each of independent interest, which it is convenient to prove first. We 
use the word polygon to mean indifferently a 2-dimensional set bounded 
by a finite number of line-segments or the boundary of such a set. 
Which is meant will be clear from the context. We shall say that a 
convex polygon is circumscribed to a convex set JT if it contains Jf 
and if every side of the polygon is a tac-line® of JT. The first lemma is 
an analogue of Theorem XI of Chapter V due to Reinhardt (1934a), 
and found independently by Mahler (1947c). 

Lemma 1. Let Of be a convex symmetric open 2-dimensional set. Then 

A{^)=\miV{34r), (1) 

ivhere ^ runs through all symmetric circumscribed hexagons and V{^) 
is the area of . 



' We speak of a tac-line in 2-dimensions instead of a tac-plane. 
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Let ^ be any circumscribed hexagon and the critical lattice 

of so that 

by Lemma 13 of Chapter V. But is certainly admissible for JT, 

and so the left-hand side of (1) is at most equal to the right. 

When is a parallelogram, the lemma is trivial, so we suppose JT 
is not a parallelogram. Let M be a critical lattice for JT so that, by 
Theorem XI of Chapter V, it has precisely 6 points ip, iqr, ±r on 
the boundary of Jf, where p, g is a basis and 

p + q + r = 0 . 

Let 3^^ be the hexagon formed by tac-lines at ip, ig, ir to JT, 
taking the corresponding tac-line — rr at p to the tac-h’ne rr taken at p, 
if that is not unique, etc. Then 3^^ is a symmetric hexagon circum- 
scribed to JT. The lattice M is admissible for 3^^ by Theorem XI of 
Chapter V, and so 

A{3f)=d (M) ^ Am = i V(3(^o ) . 

by Lemma 13 of Chapter V. This concludes the proof of Lemma 1. 

IX. 4. 2. The following lemma due to Dowker (1944 a) relates the 
areas of circumscribed polygons to a convex set 3T, which need not be 
symmetric. We sketch the proof, for which see also Fejes Toth (1953 a). 

Lemma 2. Suppose that there exists a circumscribed (« i 1)-go« 
and a circumscribed {n — i)-gon to a convex set Jf. Then there exists 
a circumscribed m-gon with m^n and area 

If 01,02,03 are three points on the boundary of 3^ then in this 
proof we mean by 

®i O2 -<^ O3 

that a^,a^,a^ occur in that order on traversing the boundary of 3f in, 
say, a counter-clockwise direction. 

Let the sides of be the lines Cj, ..., oc„_i. By definition, these 
are tac-Unes to JT. Let (1^?^« — 1) be a point on the boundary 
of JT at which a, is a tac-line. If is a tac-hne at several points, then 
we choose Oy once for all. We may suppose without loss of generality 
that 

«n-l < «1 < ®2 < < “n-^1 < «X • 

Similarly let Py and 6y be defined with respect to +1 , where 




Preparatory lemmas 



237 



We distinguish two cases. Suppose, first, that three of the bj occur 
between two of the Oy, say, 

^ ^ b^^ (t^ , 

where the symbol between and b^ means that possibly ai = 5i, but 

otherwise Oi< bj< 62 . Let have sides aj , P2 > °2 > • • • > ^^d ' have 

sides p,, Ps, ..., P„+i- Then 

^ + Vm , (1) 



as is clear from Fig. 10. Indeed the difference between the two sides 
of (1) is the sum of the areas of the two 4-gons whose sides are formed 
by «i, Pa, Pi, P2 and Oj, Oj, 

Pa, P2 respectively. From (1) 
we have 

min{F(^;), F(^”)} 

which proves the lemma in 
this case. 

The polygons 
may have fewer than n sides, 
since some sides of ^„+i may 
coincide with those of ^n-i- 
But this possibility is covered 
by the enunciation of the 
lemma. We shall not repeat 
this remark which will apply 
at a later stage in this proof 
and also to the proof of 
Lemma 3- 

If the first case does not happen, then, since there are two more 
b’s than a’s we have, on re-indexing if necessary, that 




and clearly 



K(^„_i)-r(^^) = K(pQr). 
V[0^-)-V{^n^r) = V{uvw) 

V{pqr) '^V(uvtc) . 

The point labelled 0| should be labelled at 



«1 ^ *>1 < 62 < «2 ^ ^ < *>s+l < 

for some s. Let have sides 

ai,P2 ,...,Pi,a,,...,a„_i 



and 

Pi, 02, ..., OCj_i, Pspi, ..., P„ + i 

respectively. Then ageiin 

F(^n+i) + F(^„_i) ^ -b F(^„”) , 
the difference being the sum of the areas of the 4-gons Oi Oj pi Pa and 
as-iOsPsP.+i. see Fig. 11. 
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Corollary 1. Let U{n) denote the injimum of the areas of circum- 
scribed m-gons with m^n. Then 



U{n) ^ C/(« - 1) 

and 

2U{n) ^ U{n — 1) + U{n + 1). 



( 2 ) 

(3) 



The first inequality is a trivial consequence of the definition, the 
second follows at once from Lemma 2. 

It is convenient to extend the definition of U(n) to non-integral 

value of the argument. For 
t^} put 

U{t) = (!-/) U{n]-\-lU{n + i), 
if 

t=n + l, 

Corollary 2. Let /Ui, 
be numbers such that 

{\^r^R), 2 /^, = 1 . 

Then 

UiZTrir)^ZTrUiL), (4) 

where are any num- 

bers with 

The inequality follows at once from Corollary 1 if 7? = 2 and then 
follows easily for general R by induction. 

By a similar argument to that used for Lemma 2 Dowker (1944 a) 
proved 

Lemma 3- Suppose that JT is symmetric as well as being convex. Let 
^ 2 n 2n-gon circumscribed to JC. Then there is a symmetric 2m-gon 
with m^n, also circumscribed to Cf of area at most 

Let the sides 




E(i3'„+ ,) + 



of ^ 2 n be tac-lines at 


Cti,. 


* * ) ^2 n 




where 


«i,.. 


• > ®2 o > 




Let 


^2n 


. . . *< a^n < 






P; ~ ^;±ni 




(5) 


where the bar denotes the image 


in the origin. 


Then, by symmetry. 



the Py are the sides of the circumscribed polygon which is the 



Preparatory lemmas 



239 



image of the origin. By the convexity and symmetry we have 



for every j. 



a,+i < a,. < a,.+i < a,- 



If is not already symmetric, we may suppose without loss of 
generality that a„=(=5„ and, by changing the orientation of the indexing 
if need be, that 

Then 

®2n = ^2n ®2n ^ ^2n> 



by (5). There is thus a greatest / in n^j<2n such that 



f>; ^ Oj , 

and for this j clearly 

«;i^< Vl^«7+l- 

It is not excluded that b ,-^2 ^Iso lies between and Without 

loss of generality j = n; and then 



by (5). 



a„^b„<b„+i^ 

®n + l> 



^2n ^ ®2n 



«i ^ bj , 



Let ^ 2 „, 



^ 2 n have sides 






and Pi,...,p„,a„+i,...,a 2 „ 



so ^ 2 „ and ^ 2 „ are symmetric, by (5). Precisely as in the second case 
of the proof of Lemma 2 we have 



V{^L) + Vm ^ V(^2n) + V{^2n) = 2F(^2J i 

and so either or satisfies the requirements of the lemma. 
Corollary. For convex symmetric Jf, 

A{cf)=\U{6), 

where U{6) is the infimum of the areas of circumscribed m-gons with w^ 6 . 
This follows at once from Lemma 1 and Lemma 3 . 

IX.4.3. We shall also need Euler’s formula for convex polyhedra 
in a slightly unusual form (cf. Fejes Toth 1953 a). Let v„ 
be points in the plane (vertices). Let Xj (IgsgS) be curves joining 
one vertex to another vertex or, possibly coming back to the same 
vertex (the edges). The reader may think of the Aj as line-segments or 
composed of a finite number of line-segments. We suppose that no 
point of Aj except its ends is a and that no two Aj cross. Finally we 
suppose that it is possible to get from any one vertex to any other along 
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the Xj. Then the whole plane is dissected by the Xj into a number (p 
of connected pieces (the "faces”) one of which contains all points 
outside a large circle |a;| =R. Then Euler’s formula is 

Lemma 4. 

9? + A'' = S + 2. 

This may be readily verified by induction on S. 

IX. 5. FEJES Toth’s theorem. In this section we prove a result 
due to Fejes Toth (1950a), see also Fejes Toth (1953a). He proves 
something more general and also gives interesting related results but 
we give here only what is needed to treat the lattice constants of 
cylinders. 

Theorem VIII. Let Jf be a convex open polygon with at most 6 sides. 
Let jT be any convex open set and suppose that the sets 

are packed in 3^, i.e. the Jf) are subsets of 3^ and no two have points in 
common. Then 

R U{6) ^ V{J(r) . 

where U(6) is the lower bound of the areas of m-gons circumscribed to 3f 
with m^6. 

The notation U (6) is in conformity with that of Lemma 2, Corollary. 
Fejes Toth’s own version of his proof is very compact, and we have 
found it desirable to expand it. 

IX. 5. 2. The stages in the proof of Theorem VIII are enunciated for 
convenience as propositions. 

Proposition H. Let 3^ be a convex open 2-dimensional polygon and 
let 3^, [i'^r^R) be open convex sets packed in 3^. Then there are open 
convex polygons (l^r^T?) such that 2., contains and 

(i) the 2^ are packed in 3^, 

( ii ) if <3 is a side of 2, then either, 

( iij a is part of the boundary of 3^, 
or 

( ii^) there is a subsegment a' of a containing more than a single point 
which is part of the boundary of a 2^, (s4=r), and 

(Hi) if a is a side of 3ff then some subsegment a' of 3f consisting of 
more than a single point is part of the boundary of some 2,. 

Note that the are not required to be similar to each other. We 
shall give two proofs of proposition 1. The first is by transfinite induc- 

* Mr. H. L. Davies has pointed out that this Proposition is false as it stands by 
giving a counter example. The proof of Theorem VIII can, however, be salvaged. 
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tion (Zorn’s Lemma). It involves the minimum of geometric argument, 
but is non-constructive. The second, which will only be sketched, gives 
a process for constructing the J, in a finite number of steps. 

If {Jf'} and {Jf"} are two packings of R open convex sets in 
we write 

{jT'} < {Jf"} 

if Jf/' contains JT,' for \^r^R, not necessarily strictly. We denote the 
set of all such packings by 77 and verify three statements about the 
symbol <. 

(I) If and {jr"}<{jr'} then = in the 

sense that the sets JT/ and JT/' are identical for \^r^R. This is 
trivial. 

(II) If (Jf'} <{jr"} and {JT"} then {Jf'} < {JT'"}. This 

is again trivial. 

(III) Suppose that 77 is any subset of the set of packings 77 such that 
if {JT'} and {JT"} are in 77 then either {Jf'} < {Jf"} or {Jf"} < {Jf'}. 
Condition (III) states that then there is some packing {JT} in 77 (not 
necessarily in 77), such that {Jf'} <{-^} for all {Jf'} in 77. 

To prove (III) we take for the union of for all {JT} in 77. 
We must verify that {Jf} is a packing of convex open sets, and do this 
for the properties in turn : 

First, is open. For if is a point of then it is a point of Jf,' 
for some packing {JT'} of 77. Since is open, a neighbourhood of 
is in JT/, and hence also in X",, as required. 

Secondly, X], is convex. For let «i ,«2 be any points of X,, say, 
z^^Xy , where {X'}, {X"} are packings of 77. By the hypotheses 
of (III) we may suppose, by interchanging z^^ and z^ if necessary, that 
{X'}-<{X"}. Then z^iX^C^Xy. Since z^eXy, the whole segment 

tz^+[\-t)z^ (O^/^l), 

is in Xy \ and so in X^, as required. 

Thirdly, X^ and X^ have no points in common if r =)= s. For suppose 

jT,, Zq^X^ . Then z„eXy z^eXy for some packings {X'}, {X"} in 77, 
where again without loss of generality {X'} < {X"}. Then Zf^eX^CXy , 
so is common to X," and Xy , contrary to the hypothesis that {X"} 
is a packing. This concludes the verification of (I), (II) and (III). 

We say that a packing {X*‘} is maximal if 
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implies {JT''} By Zorn’s Lemma, since (I), (II), (III) are satis- 

fied, to any packing {Jfj there is at least one maximal packing 
such that 

{Jf} 

But it is easy to see that in a maximal packing (JT'*} the sets Jfj'' must 
be polygons J, which satisfy the conditions (i), (ii) and (iii) of Proposi- 
tion 1. Since this will be clear from the constructive proof which we 
give later, we do not give the detailed argument here. This concludes 
the first proof of Proposition t . 

We now sketch a second, constructive, proof of Proposition 1. The 
fundamental process is this. If JT is any open convex bounded set and 




Fig. 12. consists of together with the ^aded region 



p is any point not in Jf, then the open convex cover of JT and p is the 
least convex set which contains JT and has p as a boundary point : 
that is, JT is the set of 

tp+{i—i)q, 0 ^/<l. 

If p is on the boundary of Jf, then the open convex cover of q and JT 
is just Jf. Otherwise the convex cover has as boundary the two tac- 
hnes from p to together with a portion of the boundary of JT. 

If now Jfi , . . . , Jfji a.re the sets of Proposition 1 , we form the poly- 
gons by successively taking the convex covers of the sets and 
suitably chosen points. Let a be any point on the boundary of 
and a a tac-hne at a. Consider points q on a along one direction, say, 
to the right of a (see Fig. 12). If q^ is to the right of q^, then the open 
convex cover of qfj contains that of q^ and For some q 

to the right of a on a it is possible that the open convex cover of 
and q overlaps some other body of the original packing. Since the 
are open, there is then a p fartherst to the right along a such that 
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the open convex cover of p and contains no points of any (r =f= f )• 

It is possible that p = a. We then get a new packing {JT'} on replacing 
Jfi by the portion of open convex cover of and p which is in ^ 

If the open convex cover of p and does not meet any (f=t=1) 
for all q to the right of a, then JTi is to be the set of points in which 
are in the convex cover of and any point q to the right of a on a. 
Similarly one may consider points to the left of a along a. 

We may repeat the process on the new sets {Jf'} and will indicate 
how after a finite number of steps it must come to an end with polygons 
have the properties (i) , (ii) , (iii) of Proposition t . We denote the 
sets at the /-th stage by {JT'}, so Suppose first that 

there is a pair of indices r, s such that JT/ and JT/ have a boundary 
point o in common. Then are obtained from Jfj, X'J by 

taking a to be a common tac-line (Chapter IV, Lemma 6) to Jf’ , JT/ 
at a and by applying the above process both to and and both 
to right and left along a. Once this has been done for a pair of indices 
r, s at the /-th stage we do not do it again for the same pair of indices at 
a later stage. If there is no pair r, s of indices for which Jf/ have 
a common boundary point and which have not already been treated, 
then there may be a body Jfj with a boundary point a on the boundary 
of If so, we take a to be the side of on which a hes (both sides 
in turn if a is a vertex of J^) and apply the process. Again, once this 
has been done for JT/ and a side of Jif we do not do it again for the 
same r and the same side of Jf’. Neither of the first two steps may be 
allowable. Suppose that one of the JT/ is not a polygon. Then a is 
taken to be any point on the boundary of Jf’ which is not in a line- 
segment forming part of the boundary of nor on the boundary of 
jf/ (s=4=>")- Finally, if all the Jf/ are polygons and the first two stages 
are impossible, then a is taken to be any vertex of a jT/ at which at 
least one of the two sides is not also a tac-hne to some 

It is clear that the steps outhned above will come to an end. And 
the final set of clearly a set of polygons having the properties 
(i), (ii), (iii) of the enunciation. 



IX.5.3. The next stage is an application of Euler's formula (Lem- 
ma 4) to the configuration of Proposition 1 . 

Proposition 2. Let .2, of Proposition! have q, sides (I^r^i?). 
Then‘S' „ 



In the appHcation of Euler’s formula, the faces will be the polygons 
together with Jq, the set of points not in or on the boundary of 3 ^. 

' The reader is reminded that is the set in which the are packed. 

^ The proof assumes facitly that every vertex of .J?” is a vertex of a . 2 ,.. 

16 * 
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Lemma 4 is not immediately applicable, since not every point is in or 
on the boundary of a ,3,. The set of points which do not enjoy this 
property is clearly open and so consists of a finite number , . . . , 
of connected open sets. By (ii) and (iii) of Proposition 1, any one of 
these sets, say, cannot contain the whole of a side ct of a J,. We 
now apply Lemma 4 where the “vertices” are of the following kinds 
(a) the sets jSfj (l^f^L), 

(/S) points not on the boundary of an ^ but on the boundary of 
at least three J, (Q-^r^R), 

(y) vertices of . 

The "edges”, for the purpose of Lemma 4, are the segments of the 
sides of the J,. joining the “vertices”. Then every side of J, gives rise 
to at least 1 but possibly more “edges”. Let q, be the number of “edges” 
surrounding J,, so 

(f) 

Since every “edge” belongs to precisely two J, the number 

of “edges” is 

( 2 ) 

Let 3^ have precisely h sides, so 

(3) 

Every vertex of type (a) or (j3) above belongs to at least three J,, 
and there are at most h vertices of type (y). Vertices of 
type (y) are on the boundary of and at least one M, (r=t=0). Hence 
the total number of “vertices” N satisfies 



Finally, the number of faces <p is 

= i? + 1 . 



From (1), (3), (4) and Euler’s 

q/ ^ ~ S 



(Lemma 4), we get 

2 q',^6R-6 + 2h. 



But clearly q'^'^qa=h, by (1), and so, by (1), (3), 



This concludes the proof of Proposition 2. 



(4) 

(5) 
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IX. 5. 4. The proof of Theorem VIII is now comparatively rapid. 
Let U (i) and , q, have the meanings they had in Lemma 2, Corol- 
laries 1 , 2 and Propositions 1 , 2. Clearly 

V{2,]^V{q,) (\^r^R): 

and so 

since the are packed in 3^. 

Hence by Corollaries 1, 2 to Lemma 2 and by Proposition 2 we have 
R-^ V(Jt) S 2 U(q,) ^ U (7?-i 2 qA ^ 1/(6). 

This is just the assertion of Theorem VIII, and so concludes the 
proof. 

IX. 6. Cylinders. We now make the apphcation of Theorem VIII 
to the lattice constants of cylinders adumbrated in § 1.5- 

Theorem IX. Let be a convex symmetric 2-dimensional star-body 
and ^ the set of points 

Then 

A(<^) =A{X'). 

We may suppose without loss of generaUty that JT, and so is 
open since the presence or absence of boundary points does not affect 
the value of the lattice constants A{^), A(3C). It was shown already 
that 

A{‘S)^A(X') 

whether or not is convex, so it remains only to show that 

d{N)-^A[J(r) (1) 

for any "^-admissible lattice A. 

We prove (1) by computing in two ways the number N =N{X) of 
points of A in a large cube 

1^,1 <X 

In the first place we have the trivial estimate 

^f(A)/V = 2»X*-bO(Z2) (2) 

as X—roo. 

By Theorem III, since A is "^-admissible, it gives a packing of -J "^. 
Let C be the set of N cylinders 

+ ®. 



( 3 ) 
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where 

aeA, max \a,\<X. (4) 

1 

These cyUnders are all contained in the cube 

maxjxyl + (3) 

if is contained in |a;|<i?. We consider only the packing of the 
cyhnders C in (5). 

For |y|<X + i?, let L{y) be the number of cylinders of C which 
meet the plane Xg = y, that is the number of a€ A satisfying (4) for which 

K-y| <i- 

These L{y) cyhnders give rise to a packing in the square 

\xf\<X + R (/ = f,2) 

of L{y) sets similar and similarly situated to Hence 

L(y)U'{6)<4{X + RY (6) 

by Theorem VIII, where U'{6) is infimum of the areas of circumscribed 
w-gons to ^ Jf with m^6. 

But clearly 

/ L{y) dy =N 

-X-R 



from the definition of Z.(y). Hence 

U'{6)N<8{X + RV. (7) 

by (6). 

Since R and U'{6) are independent of X, the comparison of (t) and 
(7) as X-> oo gives 

d{A)-^U'{6). 

But 

u'{6) = 4 A{\yT) =zi(jr) 

by Lemma 3, Corollary. This completes the proof of (1), and so of the 
theorem. 



IX. 7. Packing of spheres. The unit sphere 

|a:|<1 



in n dimensions has volume 



tJ" 



r 1 + 



V„ = V(^„) =■ 



( 1 ) 
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where ^he usual gamma function. In this section we estimate 

the lattice constant 

( 2 ) 

and are primarily interested in the behaviour of F„ when n is large. 

In the literature it is customary to use defined as the lower bound 
of the constants y'„ such that every positive definite quadratic form 
2 /• 7 ''"i in ^ variables represents a number |det(/,,.)|V- (,.Her- 
mite’s Constant”). By the arguments of Chapter I, § 3 we have 

= ( 3 ) 

We shall need to know the asymptotic behaviour of the volume V„. 
From Stirling’s formula* we have 

\im = 2ne, (4) 

n-*-oo 

where 

e = Z (»'!)■*• 

From Minkowski’s convex body Theorem and the Minkowski- 
Hlawka Theorem we have 

{2C{n)]-^V„^r„^2-”V„. ( 5 ) 

where C (n) is Riemann’s function. These inequalities lead by (3) and 



(4) to 


lim sup ny„ 2ne 


(6) 


and 


n— ► ro 

lim inf ny~^~^ \n.e. 


( 7 ) 



n— *• oo 



Of course the factor 2C(n) in (5) has no effect in (6) and might as well 
have been replaced by 1. Indeed none of the improvements of the 
Minkowski-Hlawka Theorem discussed in Chapter VI affect the constant 
on the right-hand side of (6). On the other hand, Blichfeldt (1929 a) 
has improved (7) to 

\\mmlny~^^ne, ( 8 ) 

which appears to be the best asymptotic form to date^. The argument 
is a purely packing one and makes no use of the fact that only lattice 
packings are relevant to (8). Blichfeldt’s results have been improved 
by Rankin (1947a), and yet further, by a more perspicuous argument, 

* See any reputable text book on analysis, for example Whitt.\ker and 
Watson (1902a) Chapter XII. 

^ The improvement in ( 8 ) announced by Chabauty (1952a) is not correct, 
see the review by Rankin in Maths. Reviews 14, 541. 
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by Rogers (i958c). Their methods yield considerable improvements 
for small values of n, but do not improve the constant in (8). 

Blichfeldt’s methods may be apphed to other sets than spheres, 
see Rankin (1949a, b,c) and 1955 a and the literature cited there. 

There is a detailed discussion of the non-lattice packings of 3 -dimen- 
sional spheres in Fejes Toth (1953a), see also S. Melmore ( 1947 a). 

I have been helped by my recollection of a seminar talk by Professor 
Rankin in Cambridge in the late 1940s on Blichfeldt’s method. 

IX. 7. 2. We observe first that Blichfeldt’s Theorem I of Chapter III 
may be generahzed to packings and indeed takes a quite simple shape. 
Let be any bounded M-dimensional set and suppose that the sets 

= (Igr^R) (1) 

are packed in some set Then trivially 

V{^)^RV{6^). (2) 

Suppose now that there is some function <p (x) of the vector variable 
X such that 

(i) 9)(®)=0 if 1*1^ e for some q 

and 

(ii) y>{x) = J^<p{x — X,) for aU x, 

whenever (1) is a packing of S^. 

Let ^(q) be the set of points at a distance from including 
the points of ^ itself. Then, in the first place, 

f f{x) dx^V[^{Q)\ {dx = dxi...dx„) ( 3 ) 

by (ii) and, on the other hand, by (i) 
f y>{x) dx = X I ~ Xf) dx = R f q>{x) dx = RV((p) (say), (4) 

rie) r 

since all points with ^{x — x,)^0 lie in ^(q). The comparison of (3) 
and (4) gives 

R^v{r(Q)]!V(<p). (5) 

Of course the characteristic function of which is 1 on .5'’ and 0 
elsewhere, has the properties (i) and (ii). With this as the function g>, 
the inequality (5) is rather weaker than (2), because we have replaced 
V{J^) by V{S~{Q)y. though of course this can be avoided by a refinement 
of the argument. Blichfeldt observed that there are sometimes 
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functions <p which give a better estimate than the characteristic 
function. 

For example, if y = is the sphere of unit radius, the necessary 
and sufficient condition that open spheres of radius 1 and centres Xj , Xj 
shall not overlap is jx^— Xj| ^ 2. The following lemma may be regarded 
heuristically as showing that, in a packing of spheres, a point may be 
on the boundary of two spheres but cannot be too near the boundaries 
of more than two spheres simultaneously. 

Lemma 4. Put 

9 p(x) = max{o, 1 — ^ |x|“j. (6) 

Suppose that x, are any points such that 

\x, — x,\>2 (\^r<s^R). (7) 

Then 

2?>(x-x,)^l (8) 

for all points x. 

We may clearly suppose without loss of generahty that 
0<cp{x — xf) = 1 — ^|x — x,|^ 

for \ ^r^R. 

If yi, ••., y/e and y are any real numbers, we have 

^ 2 (y - JrY = 2 (yr - + (^ y - 2 yr)* ^ 2 (yr - y,Y- 

Hence, on applying this to the individual co-ordinates, since 1®P = 
^ 1 H h , we have 

R Z lx-x,l^>Zl^r~Xsl^^2R(R-i), 

ISr&R r<s 

by (2). But this is just the same as (8). 

From this we have almost immediately 

Theorem X. Let x^ be points in the n-dimensional sphere 

1*1 <^, ( 9 ) 

and let 

lx, — Xj 1^2 (1^r<s^7?). 

Then 

i? ^ 2-"/“ -f (Z + 2‘)". 

If (p (x) is as in Lemma 4, we have 

V[(p) = J (p{x)dx = - 2|a;|2jrfa! = 2"/*(l V„, 

|3c1'<2 



( 10 ) 
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where V„ is the volume of the unit sphere. The result now follows from 
(5), since ^(p) is now the sphere 



x\<X + 2K 



which has volume 
Corollary 1 . 

I a; I < 1 satisfy 



(Z + 2J)"F„. 

The lattice constant F„ and volume of the unit sphere 



If A is admissible for |a;|<l, then the points of 2 A satisfy the 
conditions of the theorem. The number of points of 2 A in 

\x\<X 

is 

{<f(2A)}-*F(J^) +0{X’-i) =2-''(^f(A)}-iX’’K. + 0(Z”-‘). 

On comparing with the theorem and letting X-^oo, we obtain the 
required inequality. 

Corollary 2. 

lim inf ny~^^ ne, 

where 

This follows from Corollary 1 and (4) of § 7-t. 

IX. 8. The product of n linear forms. Denote by the w-dimen- 
sional set 

and let 



The set plays an important part in algebraic number theory (see 
Chapter X), but the only precise values of v„ known are 

vl = vl = 7 



given by Chapter II, Theorem IV and Chapter X, Theorem V respec- 
tively. Here we shall be concerned with estimates for v„ when n is large 
rather as in § 7- For information about what is known for « = 4 or 5 
see Chapter II, § 6.4- 

In Chapter III, § 5-3 we already gave Minkowski’s estimate 






which by Stirling’s Formula, gives 



lim inf ^ = 2 • 71 828 . . . . 

n— >00 
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Blichkeldt has given an elegant proof that 

lim infv„ ^ (2:7r)Je3 = 5 . 30653 (1) 

n-*oo 

and this we obtain in this section. 

The estimate (1) is not the best yet found. Rogers (1950a) has 
shown indeed that 

lim inf ^ 4n~^e^ = 5 ■ 70626 .... 

«— VOO 

His intricate and laborious proof may be regarded as an elaboration of 
Blichfeldt’s. 

Since ^ has infinite volume, there is no estimate of d(^) above 
from the Minkowski-Hlawka Theorem. Indeed, work of Scholz (1938 a) 
on the discriminants of algebraic number fields gives some reason to 
suspect that lim supv„ = oo. 

n— >00 

In § 8.2 and 8.3 we give two lemmas and then in § 8.4 Blichfeldt’s 
proof of (1). 

IX.8.2. The following Lemma of Schur (1918 a) also occurs in the 
theory of the “transfinite diameter” in analysis. 

Lemma 6. Let |j, be real numbers. Then 

n (1) 

i<j Vi/ 

where 

^^ = |w(w -l)}-i’”(’”-i>- 12- 2^- ... • w”. (2) 

The continuous function /J (|j— |,)^ of the m variables attains its 
•<; 

maximum, say, on say at = (l^f^w). Then, by 

homogeneity, 

(3) 

V i / i<i 

for all 1^, with equality when (1^) = (?;,). The derivative of the logarithm 
of the left-hand side of (3) with respect to each variable must vanish 
at the maximum (f,) = (rji ) ; and so 

= (l^f^m), (4) 

since 2 = T Let 

f(v) = n > 5 ) 

i 

be a polynomial in the variable rj. Then (4) is 
f"iVi) _ m (m- i)T ]i 

2/'(y,) ” 2 



( 6 ) 
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The polynomial 

Hv) =/"(»?) -w(w - \)r]f'(r{) 

is of degree at most m — \, since the coefficients of rf* vanishes. By 
( 5 ) and (6) we have h {rj^ = 0 ( 1 ^ ^ »t) ; and so A (?;) vanishes identically : 

f"[rj) —m(m — i)rjf'(rj) + (m — i) f (rj) =0. (7) 



The diflerential equation (7) determines f{rj) completely in terms of, say, 
/(O) and /'(O). Hence we may determine the symmetric functions 
and JJ (rji— rjj)^ in terms of /(O) and /'(O). Since =1 and the coef- 
ficient of 7]” in f{rj) is 1, this determines f{rj) completely, and so also 

( 8 ) 



It is simpler, however, to use a more indirect approach which will now 
be described. 

The resultant of two polynomials, say, 

= i7 (»? -a;). y>iv) n {il -^i) 

with highest coefficient t is defined to be 



R{<p>w) =n 

»,7 

= n v(“.) 

i 

=={-iY^n<p{fii) 

i 

= 9)- 

If 

w(»?) = n ir)-yk) 

is a third polynomial with highest coefficient t, and if 

"W =^w{'n) +x‘^n)9{n) 



(9x) 

(9.) 

(9,) 

(%) 



identically for some number A and polynomial x(v), then 

Ji((p,co) =A^R(<p,f). ( 10 ) 

by (92). 

In particular, if /(rj) is defined by (5), we have 

^ = n (Vi - ViY = (- TJ f'iVi) 



( 11 ) 
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where 

kWl 

has highest coefficient 1. More generally, put 

so that f^(r]) has highest coefficient!. Then on differentiating (7) k 
times one readily obtains 

+ w(w -!) /*(»?) =0. (!2) 

Hence 

(13) 

on using the rules of operation (94) and (10). But f„{rj)—\ and 
/m_i(?;) =»? +y for some number y (in fact y = 0); so 

(14) 

by (Oj). The required value (2) fort? follows now from (11), (13) and (14). 

IX.8.3. We also require an estimate of the number occurring in 
the last lemma. 

Lemma 7- If 

G„ = \-2^ m", 

then 

lim sup {w* log G„ — I log m} ^ ^ . 

m — ^00 

Put 

g(A;) =;Klog;r (:t>0). 

Then 

g'{x) =log* + 1 

increases with x ; and so 

g(* + <) +f(*-0^2g(%) (1) 

for any t, since if f>0 we have 

g(^ + f) -g(a;) =/g'(fi) 

g{x)-g{x-t) =fg'(fa), 

where it<x<^j, so g'da) <^'(fi)- 
In particular, 

f g(x)dx = J {g{l + t) +g{l-t)]dt^g(l), 
i-h 0 



( 2 ) 
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for any integer 1. Thus 

logG,„= 2 giPl^ j g{x)dx=\{m-\-\)no^[m + \) + 

.3 

where y is independent of m. The required estimate now follows at once. 

Corollary. If = is the number defined in 

Lemma 6, then 

lim sup {w'* log <?„ + i log w} ^ 

W-^00 

This is immediate. It is not difficult to see that ‘‘lim sup” may be 
replaced by “lim”, but we do not need this. 

IX.8.4. We can now prove Blichfeldt’s Theorem on the product 
of linear forms discussed in § 8.1. 

Theorem XI. Let v” be the lattice constant of the set 

jV: \x-^...x„\<\. 

Then 

lim inf ^ (27r)l«l. (1) 



Let A be a lattice which is admissible for JT, and let m be an integer 
whose value will be settled later. 

Consider the sphere 

|®| < Q, 

where q is chosen so that 

V(S>) =md(A)] 

that is 

e”V„ = md{A), (2) 

where V„ is the volume of |a;|<l. By Blichfeldt’s Theorem I of 
Chapter III, there are m points in ^ whose differences 

ajj— Xy all lie in A. Put 



and write 
Then 
and so 



Xi = {xu....,x„i) {i^i^m), 

2 S* = 2 \Xi\^^mq^\ 



n 

l^k^n 




( 3 ) 



by the inequality of the arithmetic and geometric means. 
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Now let 

Pk = n {Xki-Xkj)^- 

lgt<;gw 

Then, on the one hand 

(4) 

by Lemma 6 , where §„ is the number defined there. On the other hand, 

n Pk= n 

where 

fix) =x^...x^. 

The points x^ — Xj belong to A, which is .T^admissible ; and so 
Hence 

n Pk^^- ( 5 ) 

On eliminating from ( 3 ), (4) and (5) we get 

(6) 

Hence, on eliminating q between ( 2 ) and ( 6 ), 

{d{^)y'’'^XlX2X3, (7) 

where 

X2 = niV^I\ 

and 

X3 = 

Now Xi is independent of n and 

liminf;t;i^ei, ( 8 ) 

by Lemma 7 Corollary. Further, X 2 is independent of m, and 

lim^X2=(^^^y (9) 

by (4) of § 7.1. Finally, 

lim;i'3 = l (10) 

if, say, m=n-*oo. 

Since v„ is the infimum of over ,T^admissible lattices, and 

since the product of the right-hand sides of ( 8 ), ( 9 ) and ( 10 ) is the right- 
hand side of ( 1 ), this proves the theorem, by ( 7 ). 
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Chapter X 

Automorphs 

X.l. Introduction. A homogeneous linear transformation to is said 
to be an automorph of a point set .5^ if .5^ is just the set of points tax, 
xe£^. The automorphs of a set ^ evidently form a group. Many of 
the point sets of interest in the geometry of numbers, or which occur 
naturally in problems arising in other branches of number-theory, have 
a rich group of automorphs which is reflected in the set of .5^-admissible 
lattices. Already in the work in which he introduced the notion of 
limit of a sequence of lattices, Mahler (1946d, e) laid the foundations 
for future work and indicated some fundamental theorems. Since then 
much has been done but some challenging and natural questions remain 
unanswered. 

Mahler (1946d, e) considers star-bodies with groups of automor- 
phisms having special properties which he calls automorphic star-bodies. 
In this account we prefer in each case to state the properties of the 
group of automorphs which are required to hold. 

We shall say that a homogeneous linear transformation to is an 
automorph of a lattice A if u) A = A, that is if A is precisely the set of 
taa, aeA. This is really a special case of the definition at the beginning 
of the chapter since A is a point set. Since 

d{taA) = |det{w)| d{A). 

we must have 

det (to) = zh t • 

We say that to is an automorph of a function f{x) of the vector x if 

f(tax) =f(x), 

for all X. In particular, to is an automorph of the distance-function 
F{x) if and only if it is an automorph of the star-body 

F{x)<i, 

since and F(x) determine each other uniquely. Clearly 

F(toA) =F(A) 

for a lattice A if to is an automorph of the distance-function F(x), since 

F(A) =miF(a), 

o6A 



by definition. 
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If 9’ is any point set and t a non-singular homogeneous linear 
transformation, then 

Zllxy”) =|det(x)|Zl(^^), 



since a lattice A is admissible for ^ if and only if xA is admissible 
for x^. 

In this chapter we shall make great use of the properties of homo- 
geneous hnear transformation expounded in Chapter V, § 2. In par- 
ticular we write 



if 



<p=p-i-o, 

tpx = px -|- O®, 4»X=p(ox), 



respectively for all x. 

X.1.2. We first give three theorems which are already in Mahler 
(1946a, b) but not all as formulated here. We give also the proofs: 
their brevity shows the power of Mahler’s techniques, particularly in 
the striking Theorem III. 

Theorem I. Let F(x) be a distance-function with an automorpk a> 
such that 

det (u>) 4= ± 1 . 



Then F(A) =0 for all lattices A. 

By taking instead of oj if need be, we may suppose that 

|det(u>)| < 1 . 

If there is a lattice A with F(A)d=0, then there is a critical lattice M 
for F{x)<\, by Theorem VI of Chapter V. But then 

F(wM) =F(M) =1, 

and 

(i{u>M) = ldet(u))| £^(M) 

in contradiction to the definition of a critical lattice. 

For example. Theorem I shows that 

< 1 

is of infinite type since it has the automorphs x^-^ix of de- 

terminant 2. This was our example of a star-body of infinite type in 
§ 5 of Chapter V. 

Theorem II. Let F(x) be a distance-f unction. Suppose that every 
point Xq with F(x^) =1 is of the shape 



Xq — Oq , 



Cassels, Geometry of Numbers 
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where to is an automorph of F, and is in a compact set Then for 

every lattice A with F(A) = 1 there exists a lattice M with 

F{M)=\, d(A)=d{M) 

having a point in 

Since F{x) is continuous, the set of points c€'^ with F{c) =i is 
compact if is compact. Since c„ as defined in (1) has F(Cg) =F{x„) = 1 , 
we may suppose without loss of generality that 

F{c)=i (C6'g’). (2) 

Since f(A) =1, there is a sequence of points a^^A, not necessarily 
distinct, such that 

F{a,)-^\ (r~^oo). 

Then 6,= {F(a,)}“^a, satisfies F{bf) = 1 ; and so 



for some automorph to, of F and some Since is compact, we 

may suppose, after extracting a subsequence and re-indexing, that 



Let 



{r-^oo). 

A = to. A,. 



Then, since |det(to,)| =1 by Theorem I, we have 

F(A,) = F(A) - 1 , d (A,) = d (A) 

and 

F(a,) c,6A,. 

By Theorem IV, Corollary of Chapter V, the sequence A, contains a 
convergent subsequence, and so, without loss of generality, 

A,^M 

for some lattice M. Then 

i(M) = lim d{A,) =i(A) 

r-^oo 

and 

F(M) ^ lim supF(A,) = F(A) = 1 (3) 

f -^00 



by Theorem II of Chapter V. Further, M contains 

e' = lim F(a,) c,, 

r-*oo 

SO 

F(M) ^ F(c') = 1 , (4) 

by (2). From (3) and (4) we have F(M) =1. This concludes the proof. 
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Corollary. There is a critical lattice for F(x)<i having a point c 
in ^ with F{c) =1. 

For if A is critical so is M. This corollary is in contrast with the 
example given in § 5-2 of Chapter V of a star-body no critical lattice 
of which has points on the boundary. Note that the corollary does 
not affirm that every critical lattice of i*'(a5) < 1 has points on F(£c) = 1 ; 
the author [Cassels (1948a)] has given a rather artificial* counter- 
example of a body F(»)<1 satisfying the hypotheses of Theorem II 
and having critical lattices with no point on F(ac) =1. 

As an example of Theorem II consider the body jF: \xiX 2 X^\<\ 
with its distance-function IajYjYjIC Here may be taken to be the 
single point c = (1,1,1); since every point *o= (aiq, Ajo, ^ 3 o) with 
\xio^ 2 o^ 3 o\ = I is of the shape 

aC(, = u) c , 

where cu is the automorph 

of jF. Hence there are critical lattices for ^with a point at (1, 1, 1). 
If one is concerned only with the evaluation of A[jF) and not with the 
enumeration of all the critical lattices, it is enough to consider critical 
lattices with a point at (1,1,1). 

Theorem HI. Let the point-set ^ be a subset of the star-body SF 
with oo. Suppose that for every r there is an automorph o), of .9'’ 

such that contains every point of Lf which is in |a;|<r. Then 

4l(^) 

Clearly 

By Theorem I we have det (u>,) = i 1 , and so 

A{^) =zJ(w,^)^zJ(^,), 
where is the set of points of 6T in |*|<?-. But 

lim A {6^,) =A(£^) 

f— voo 

by Theorem V of Chapter V, so A{^)=A{£F), as asserted. 

Clearly one may formulate theorems similar to but more general 
than Theorem III by making use of the full force of Theorems II and V 
of Chapter V. The argument used in the proof of Theorem HI was 
already used in the proof of Theorem XV of Chapter V. 

* As Professor Rogers remarks, it is quite likely that the 3 -dimensional body 
|.T]| max (Xj, X 3 ) < 1 furnishes a natural example. 
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As an example of Theorem III one may take for ^ ^ respectively 
the sets 

. I Xj X3 j <C t 

and 

S'-. \xiX^X 3 \<i, |x2|<e, |a;3|<£, 

where s is any fixed positive number. Then the automorphism to, may 
be taken to be 

Xi = r-^s^Xi, X^ = re-^Xi, X^^re'^x^, 

where X = io,x. In this example one may deduce that a lattice A 
with d{A)<A {S) has infinitely many points in y. For A must have 
a point in S' for every e>0. If A has no point a=t=o with a2 = ^3 = 0. 
this imphes that A has infinitely many points in S ; and on the other 
hand, if a = {Ui, 0, 0) is in A, then all the points ma (m = \, 2, ...) are 
in A, so there are still infinitely many points of A in S'. Indeed the 
argument shows that for any £>0 there are infinitely many points 
of A in This sort of argument was already used for Lemma 12 of 
Chapter V about the existence of infinitely many points in —1 <x^x^<k. 
There we could prove rather more since this set was shown to be bound- 
edly reducible. In § 7 we shall make a systematic study of when there 
are infinitely many points of a lattice in a star-body following Daven- 
port and Rogers (1950a). 

X.1.3. The point sets with a large group of automorphisms with 
which we shall be concerned will be mainly constructed simply from an 
algebraic form <p [x) . For example cp (x) may be x^ x^ , x^ x^ x^ , % (x| -)- x|) 
or xf-j-x^ — Xj, and the set S may be defined by 

i?>(®)|<'l (f) 

or 

0 ^ «??(*) <1 ( 2 ) 

or 

0<9)(a:)<l (3) 

or 

— k<(p{x) < 1 , (4) 

where k and I are positive numbers. Of course (2) and (3) are not star- 
bodies. Apart from sets especially constructed from sets of the type 
(1) — (4) to act as counter-examples, other sets with large groups of 
automorphisms have proved intractable. For example the lattice 

constant of 

|xi| max(x2, xl) < 1 

is not known, though it would be of some interest in the theory of 
simultaneous approximation and the problem has had considerable 
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attention [see Davenport (1952a) and Cassels (1955a) and the refer- 
ences given there]. 

We shall make continual use in this chapter of the results of Chap- 
ter I, § 4 about the relationship of lattices to forms. 

A particular kind of lattice plays a special role in connection with 
sets of the type (1) — (4), where <p{x) is an algebraic form. It is useful 
to introduce some new terminology. If (p{a) is an integer for all aeA 
we say that (p is integral on A. If, further, 93(0) =0 for o6 A only when 
a—o, we will say that <p is non-null on A (the trivial zero at o being 
disregarded). Finally, if there is some number i^=0 such that t<p is 
integral on A we say that 93 is proportional to integral on A. Then 93 
is integral on |f|^^”A, where m is the degree of 93. 

In many, if not all, cases where the form 93 has infinitely many 
automorphs and the critical lattices A^ for one of the sets (1) — (4) are 
known, it turns out that <p is proportional to integral on A^. Indeed 
in some cases 93 is proportional to integral on every known admissible 
lattice, and it is suspected, but not proved, that no other admissible 
lattices exist. In other cases, there certainly do exist admissible lattices 
on which 93 is not proportional to integral, but the critical lattices are 
not amongst them. 

Before discussing the general properties of a lattice A on which a 
form^ 93 is proportional to integral and illustrating it with concrete 
examples, it is convenient to prove a simple lemma. 

Lemma 1. Let r>0 and m>0 be integers and let 

y(«i. •••.",) 

he arbitrarily given numbers for integers u^ in 

(l^p^r). (5) 

Then there is a uniquely defined polynomial f [u) of degree m in the variables 
such that 

f[u)=y(u) (6) 

for all integers tt = («<^, . . . , «,) in (5). 

This is certainly true when r = \. For r>1 we use induction on r. 
We may write 

/(tt) = 2 <?^(M],---,m,-i). ( 7) 

where the are polynomials to be determined. For any fixed values 

of «i, the equations (6) determine uniquely the values that 

must be taken by g^(«i, ..., «,_i) in (5); and then there are uniquely 

* We recollect that the word "form” implies homogeneity. 
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determined polynomials taking these values, since we assume that the 
lemma has already been proved with r—i for r. Alternatively one 
could observe that the determinant of the (w + 1)'' equations for the 
(m + 1)' coefficients in /(u) have determinant 

77 

0^u<v^m 

Corollary. If they (u^, m,) are rational, so are the coefficients in f. 

This follows at once from the proof. 

Now let 9 ? be a form which is integral on the lattice A with basis 
6j, ..., b„. Put 

/(u) = /(Ml , . . . , M„) = 9? Uj bj '^ . (8) 

By Lemma 1 , Corollary, the coefficients in the form /(u) are rational. 
Conversely if the coefficients in / (u) are rational, then q> is proportional 
to integral on A. 

We shall now describe in some detail what happens in some special 
cases which have been extensively investigated. 

Suppose, for example, that 

9 ? {x) — x\ + x\ — x\, 

so that f{u) in (8) is any indefinite ternary quadratic form of signature 
(2, 1) (cf. § 4 of Chapter I). No-one has yet been able to construct a 
ternary quadratic form which can be shown not to take arbitrarily 
small values for integral u. apart from the multiples of forms with 
integral coefficients. Oppenheim (1953 b, c) has shown^ that an in- 
definite quadratic form which takes arbitrarily small values of one sign 
also takes arbitrarily small values of the other. Such a form then takes 
values in every interval, since 

f{ru) =r^f{u) 

and /(u) may be taken arbitrarily small of either sign. 

The situation is much the same when 

(p(x) =XiX^X3. 

Then the function /(u) given by (8) is the product of three real linear 
forms: 

f{u) =n (bj 1 Ml + 6; 2 «2 + bj 3 M3) ; ( 9 ) 

* He also shows that if an indefinite quadratic form is not a multiple of a 
form with integral coefficients and takes the value O then it also takes arbitrarily 
small non-zero values for integer values of the variables if the number of variables 
is greater than 5. 




Introduction 



263 



and conversely every product of three linear forms (9) with 

det(6. *) 4=0 

gives rise in this way to a lattice A. A classical theorem which we 
shall prove in § 4 states that if the coefficients in /(u) are rational and 
/(u) may be expressed as the product of three real linear forms and if, 
further, / (u) 4= 0 for integral u =|= o, then 

f{u)=tn ifin Ml + «2 + Pi3 M 3 ) . 

1S;S3 

where Pn, ^ 12,^13 are numbers in a totally real cubic field Sj and 
is the conjugate of in the conjugate field Sy. 

On the other hand, there are certainly lattices A which are admis- 
sible for 

|XiXj| < 1 

and on which is not proportional to integral. This follows at once 
from the theory of continued fractions: alternatively it is not difficult 
to modify the proof of Theorem VIII of Chapter VI. 

A rather more interesting case is 

(p{x) =x^[x\-\-x\). (10) 

Since 

(p (x) = Xj (Xj -h 1 Xg) (Xg - i Xg) , 



where — there is a connection with the cubic fields that are not 
totally real, similar to that of %XgXg with totally real fields: it is clas- 
sical, and will be proved in § 4.4 that if Xj [x\ -j- xl) is proportional to 
integral and non-zero on A, then A arises from a cubic field. But there 
certainly are other admissible lattices for 

\Xi[x\+x\)\<\. (11) 

Let T be any transformation X = xx of the special type 



where 



II 


Xl ] 


^2 = 


^2 3^3 1 


^3 = 


"^^3 2 ^2 “t” "^3 3 ^3 » ) 


Til =)= 0 


^22 ^33 ^23 ^32 ^ 



Then there are clearly constants C, c depending on t, such that 



00 > C ^ 



|y(Tag)| 

!?>(*) I 



^c>0 
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for all X. Hence if A is admissible for | 9 s(®)|<l, so is fxA for some 
number t; and in general (p{x) will not be proportional to integral on 
tA if it is on A. 

This does not exhaust the admissible lattices for (x\ + *^|) < 1 • One way to 
show this is to use the arithmetic-geometric mean inequality in the shape 

klW + ^DI >2\Xj^X^X3\. 

Hence any lattice admissible for | ;Tj Xj | < i is also admissible for | (xj -f x|) | < 1 ; 
for example 2~^M has this property if x^x^x^ is integral and non-null on M (i.e. 
when M arises from a totally real cubic field) ; and it is easy to see that x-^ (;r| -|- Xj) 
cannot be proportional to integral on M. [In fact the ;Ti- co-ordinates of M for 
a lattice on which x^x^x^ or x^{xl-\- xl) is non-null and proportional to integral 
determine the relevant cubic field completely and it cannot be both totally real 
and not totally real.] More generally, one can construct admissible lattices by the 
methods of Chapter VI, Theorem VIII, compare Cassels (1955b) for a closely 
related problem. 

It is an interesting problem to decide for any given form (p{x) if 
there exist admissible lattices for a set | 9 p(a!)|<l on which q>(x) is not 
proportional to integral. Cassels and Swinnerton-Dyer (1955 a) have 
considered the special cases q> (x) = Xj Xj Xg and xl + xl — xl, but they only 
transform the problems into another one. For another line of attack, 
see Rogers (1953 b). It is reasonable to think that essentially new ideas 
will be required even to cope with or Xi-f — 

X.1.4. An important part in the theory is played by so-called isola- 
tion theorems. Their importance was first apparently recognised by 
Davenport and Rogers (1950a) though there are foreshadowings in 
Mahler (1946e) and indeed in Remak (1925a). A new type of isolation 
theorem is proved and exploited in Cassels and Swinnerton-Dyer 
(1955 a). 

The phenomenon of isolation takes various forms all of which state, 
roughly speaking, that lattices in the neighbourhood of a given lattice 
M, with certain exceptions, are much worse behaved than M itself. 
Thus one result we shall prove is that if Xj^x^x^ is integral and non-null 
on a lattice M, then to every e>0 there is a neighbourhood S of M 
in the sense of § 3.2 of Chapter V, depending on e. such that 

inf IxiXjXal < e 

aceA 

for all AeS except the A of the shape /M, for a number t. This is a 
particularly sweeping result. Perhaps more typical is the isolation 
theorem for Xi[x\-\-x\}. This states that if 

inl\xy_{xl + xl)\ =1, 

*6 M 

X 4: O 
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and if is proportional to integral on M, then there exists 

an r]Q>0 and a neighbourhood 2 of M, such that 

inf|xi(xl + x|)l <t 
acg A 
+ o 

for all Ae2 except those of the type xM, where x is of the special type 
with Ti 2 = Ti 3 = T2 i = T 3 i = 0 already discussed in §1.3- Note that for 
Xj Xj X3 , the number e could be chosen at will, whereas for Xj (x| + x|) 
both tjq and 2 are fixed by the lattice M. 

All isolation theorems have the same general type of proof. In the 
first place, it is shown that if the form q){x) is, say, integral or integral 
and non-null on a lattice M, then q)(x) and M have a group S2 m of 
automorphs cj in common; that is 

(p{iiix) = (p[x) , w M = M . 

For the special forms XjXj, X1X2X3 and Xi(x|-)-x|) these automorphs 
are given by the theory of units in algebraic number fields, and for 
Xi-|-x| — x| by the theory of indefinite ternary quadratic forms; but 
we shall, in fact, find it easy to handle the group without these 
theories and using only Mahler’s compactness theorem^ A lattice A 
near M, in the sense of Mahler, is one of the shape 

A =xM, 

where x is near the identity transformation. Suppose that there is an 
aj€ M such that <p{a^ takes some interesting value a. Then 

93 (w Op) = 9? (Oq) = a , a>g£2M- 

Then A contains the point xwOp. Although |xap— ao| is small when x 
is near the identity, it does not follow that |x<uao— uiOol uniformly 
small for all to, since in general co may be chosen so that loOp is arbi- 
trarily large. By suitable choice of co in £2^ one may then show the 
existence of a point xtoop in A=xM having the properties desired in 
the problem in question, unless the transformation x satisfies certain 
conditions. Sometimes one must start not with one point Op, but with 
several, Oj , . . . , a,, , so as to eliminate x of different kinds. This general 
attack will be clearer from the examples in § 5- Isolation theorems may 
be used to discuss the existence of infinitely many lattice points in 
regions, as will be shown, following Davenport and Rogers (1 950 a) , in § 7. 

X.1.5. Before going on to the main subject matter of the chapter 
we shall discuss in § 2 certain special forms and their groups of auto- 
morphs. In § 3 we shall then discuss a method of Mordell which shows 

^ One of Minkowski’s first applications of the geometry of numbers was in 
fact to the theory of units in algebraic fields. 
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how a bound for the lattice-constant of an «-dimensional body may 
be obtained from a bound for that of a related (n — l)-dimensional body. 
When the original n-dimensional body is of a special type having many 
automorphs, Mordell showed the argument can be carried a stage fur- 
ther. In particular it gives the lattice constants of the 3-dimensional 
sets \x-^x^x^\<\ and \x^{x\ + x\)\<\. In §8 we discuss briefly the 
relevance of continued fractions to forms and bodies with automorphs 
and the possibility of generalisation. 

X.2. Special forms. We discuss first the automorphs of the form 
?(X) = { U { n + xUs+k)]' n=r + 2s, (1) 

where both the possibilities r=0 and s = 0 are permitted. We may 



write 






where 


<p{x) =f(z) = 77 z,, 

Ig/gn 


(2) 




x,=Xj 1 






Xf+s + k — X,+ k **r+5+* J J 


(3) 


and — 1. 


If the X, are all real, then Zj is real for 1 ^j'^r 


and j 



and are conjugate complex numbers for f and conversely, 

if the are of this shape then the x, are real. Let now u) 

be a real automorph of 95 ( 0 :). In the obvious way it gives rise to an 
automorph u) of y>{z). Let Z = Hz. Then 

n Xi = [j Zi ( 4 ) 

identically in z^, z„, where the Z, are linear forms in z„. The 

only possibility is that Zi = AiZi where L=L(l) is a permutation of 
1 , ... ,n and , . . . , are real or complex numbers. For our purposes, 
it is enough to consider the automorphs 

Zi = XiZi (1^/^m), 

where 

lil£n 

by (4). But the transformation to transforms the real point a; into the 
real point X = uix. Hence Zj,...,Z, are real and are 

conjugate complex, and so 

A, = real (1 ^ ^ r) 

conjugate complex (t^A^s). 




(5) 

( 6 ) 



( 7 ) 
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Conversely, if the numbers satisfy ( 6 ) and (7), then (5) defines a real 
automorph to of 99 (aj). 

We shall also need the transformation u>* polar to u>, that is the 
transformation such that identically 



when 

Now 



1^/gn 

X=^uyx, F = w*y. 

I I 



where Zi is given by ( 3 ) and 



Wj = y,- (1 g ^ r) 

2^r+k=yr+k-iyr+s + k ' 
^‘^r+s+k=y,+ k + iyr+s+k . 



(i^k£s). 



Hence if w* induces the transformation Z>* in the ip-co-ordinates, we 
must have 



w, = kr^wi, 



where FF = to* t». In particular, the transformation to* is also an auto- 
morph of (p(x). 

X.2.2. We shall require also to know something of the automorphs 
of the form 

^{X)=X\^ h^r-^»^r+l Xl, (1) 

where possibly r — n, so there are no negative terms. For completeness 
we prove the well-known 

Lemma 2. If (p{x) is defined by (1) and x„ is any point, then there 
is an automorph to of (p(x) such that, for some number t, 

to Xo = (C 0, . . . , 0) 
or 

toxo = ( 0 , ...,0, t) 
or 

toXo= (t,0, ...,0, t) 

according as (p (®o) > 0, 99 (Xq) <0 or (p (Xj) = 0. 

This is certainly true for n = 2, since then there are the well-known 
automorphs X = <ax given by 

Zj = cos + *2 sin j? , Zj = — ;Ti sin -\-x^ cos ^ (2) 
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for any real ^ when r=n= 2 , and by 

Xi + Xi = k{xi + Xi), Xj - Zj = (Xi - Atj) (3) 

when r = l, n =2 and k may take any values except ^=0. 

Next, the lemma is true when r = n. For we may suppose it proved 
for M — 1 . There is then an automorph fc>i acting only on the first « — 1 
co-ordinates such that 

= (m, 0, . . . , 0 , X„q) 

for some u. Then an automorph <«>2 acting only on the first and last 
co-ordinates makes 

u>2 Xi = {t, 0 , , 0) 

for some t. Then u) = u>2 does what is required. 

Finally, the lemma is true in general. For we may find in succession 
automorphs Uj , a>2 , a>3 such that for some numbers u, v we have 

= coi = (m, 0, . . . , 0, 0. • • • . ^no) . 

a!2 = ojjaci = (m, 0 , . . . , 0 , 0 , . . . , 0, r) ; 

and then 

X 3 = 0)30:2 = (<, 0 , . . . , 0 ) or ( 0 , . . . , 0 , t) or (/, 0 , . . . , 0, t) ■ 

Corollary. If o) is the automorph constructed above, then the polar 
u)* is also an automorph. 

It is readily verified that the polars of the special transformation (2) 
and (3) are automorphs of cp(x). The required result now follows by 
induction. 

[It IS in fact true that if t>> is any automorph of 95 [x) then its polar is also an 
automorph. This is most easily proved using matrix theory. Let cd for the nonce 
denote the matrix whose elements are the coefficients in the transformation ci> 
and let e be the matrix with 1 in the first r places on the diagonal, — 1 on the 
remaining diagonal places, and 0 elsewhere. The fact that u> is an automorph is 
expressed by 

w'eu) = 6, (4) 

where the dash (') denotes the transposed. On taking the reciprocal of (4) we 
obtain 

cj-i (5) 

But the polar w* of to is clearly = and so w* is an automorph of <p by 
(5), since 6^*=C.] 

X.3. A method of Mordell. In this section we discuss a method of 
Mordell for estimating the lattice constant of an «-dimensional set 
by reducing the problem to an (n — 1)-dimensional one. 

Let ^ be any «-dimensional set, not for the moment necessarily 
endowed with any automorphs and A a lattice. In Chapter I, § 5 it 
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was shown that if b is any point of the polar lattice A*, then there 
are m — 1 linearly independent points of A on the plane 

TTb : xb =0 

(scalar product). The plane 715 cuts in an (m — 1)-dimensional set 
In an obvious sense, there is an (tt — 1)-dimensional lattice Aj, 
consisting of the points of A in rr^ . Hence, if we can show that there is a 
point other than o of the (« — 1) -dimensional lattice A5 in then there 
is certainly a point other than o of A in 6^. If i„=(=0, for example, one 
could project on to the hyperplane x„ = 0 and use Lemma 6 Corollary 
of Chapter I. For this procedure to be effective, the vector bgA* must 
be chosen so as to give a good {n — 1) -dimensional problem in TTfo; and 
so in general we have replaced one «-dimensional problem by another, 
rather vaguer, one for the polar lattice, together with an (m — 1)- 
dimensional problem. 

In this shape the technique has been applied by Mullender (1950a) 
and Davenport (1952a) to the enigmatic 3-dimensional starbody 

I Xj I max (x| , X3) < 1 . 

Making use of the known (cf. § 3.3) lattice constant of the set 

\Xi\ {x\+x\) < 1 , 

they select a point b of A* for which Ji(f>2+ ^3) is small and then treat 
the 2-dimensional problem in tti,. 

Mordell (1942a, 1943 a, 1944b) observed that it is sometimes 
possible to make the w-dimensional problem for the polar lattice the 
same as the original problem; and then the M-dimensional problem is 
reduced entirely to one or more (» — l)-dimensional problems without 
the need to solve an M-dimensional auxihary problem. The sets ^ for 
which this procedure is feasible are those with a large group of auto- 
morphs, so it is appropriate to discuss them in this chapter. From one 
point of view it may be regarded as based on a generalization to non- 
convex bodies of the results in Chapter VIII about polar convex bodies. 



X.3.2. We first consider quadratic forms, for which Oppenheim 
(1946a) has given a neat treatment following Mordell (1944b). 

Theorem IV. Let j = 2J(.^ s) be the lattice constant of the (r + s)- 
dimensional star-body 






\x\ + 



X, - - 



< 1 



for r^O, s^O. Then 



min (/;!+:., 



(f) 

( 2 ) 



where the first or second term is omitted if r = 0 or s = 0 respectively. 
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Write 

(p (ac) =(p^^^{x) ^ x\-\ h - ^f+i 

and 

W\iN =inf |?'(a)l 

a t A 



for any lattice A. Then, by homogeneity, 



r,s 



sup 

A 



(|y|(A)T 

d^A) 



+ s 



xUs. ■ ( 3 ) 

( 4 ) 

( 5 ) 



over all lattices A, with the natural convention that if (A) =0 for 
all A, then j= oo; as most probably happens when t'>0, s> 0, 
r+s^5 (see appendix A). 

We show first that 

{l9^1(A)y+-i^r2{|95|(A*)}<f^(A), (6) 

where A* is the polar lattice of A and 

C = min (7) 

It is enough to show that 

{bl(A)y+-i^C-^|9.(b)|<f^(A), (8) 



where b is any primitive point of A*. After Lemma 2 we may suppose 
that 6 is one of the points 

bi = (/,0,...,0), b^={0....,0,t), ba= (t,0,...,0,<), (9) 



where b^,b^, bg occur only if r > 0, s > 0 and both r > 0, s > 0, respectively. 
Consider first b = bj , where 

(p(bj}=fi. (10) 

By the results of § 5 of Chapter 1 there is a basis Oj , . . . , a„ for A such 
that 

bj Oj = 1 , bj a, = 0 (2 ^ ^ : 

so that ai=(r^, a() and a, = (0, a') for /4=f. where a- is an (« — 1)- 
dimensional vector. Hence the points of A in x^ = 0 form an (« — 1)- 
dimensional lattice M in the space with co-ordinates x„ with 

basis a' (2^/^«). Further, 

d{A) = |det(Oi, ...,a„)| = |ri| |det(a 2 , ...,a(,)| = |<|“ii(M). (11) 

But now by (5) with r — 1, s for r, s we have 

{Wr-i.s\ (M)y+-i^/;it<f^(M) 1 9>(bi)l^MA) (12) 
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by (10) and (11). This proves (8) in the case 5 = 6j since the left-hand 
side of (12) is not less than {|9p| (A)}’^+®“^. The proof of (8) in the second 
case, when 6 = 62 in (9), is similar except that the roles of r and s are 
interchanged. 

It remains to consider the case 

6 = b3=(i,o,...,O,0, 
which occurs only when r>0, s>0, so 

q>(b) =0. 

There then exist a basis Cj, . . ., a„ of A such that 
bOj = 0 (2 ^ ^ m) . 

Introduce new co-ordinates x'j by 

x[ = x^ + x„, x'„ = x^-x„, x'j = Xj (/4=1 ,m), 

so that 

^{x) = x[xl + x'2^ h x’r^ - x',h 

and the points Oj , . . . , o„ lie on x[ = 0. The points of A on = 0 form 

an (n — 1) -dimensional lattice M, and (p{x) with =0 depends only on 
the « — 2 variables X2 , ..., Hence |9)(®)| takes arbitrarily small 

values on M; for example, by the degenerate case of Minkowski’s 
convex body Theorem, there are points of M other than o with 

x[ = 0 , |At)|<e (2g/^n — 1), 

where e>0 is arbitrarily small, since this set has infinite (w — 1)- 
dimensional volume. Hence (8) holds also when 5 = 63; and so generally. 
This concludes the proof of (6). 

We may also apply (6) to the lattice A* with its determinant 
d (A*) = d~^ (A) and its polar lattice A : 

{l9,|(A*)}'+-i^C-=*{|,p|(A))i-MA). (6') 

On eliminating |99|(A*) between (6) and (6'), we obtain 

{kl (A)f (A) }*<'+*-**>. 

This imphes the required result (2) on using (5) and (7). 

In general there is no reason to expect there to be equality in (2), 
but this sometimes happens, as in the following 

Corollary. 

r — A r — 3 

■'4,0 — 2* ■‘2, 2 — 2- 
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By Theorems III and VII of Chapter II, we have 
■^,0 = 2 ^ 1 = 2 = (!)*• 

Hence the theorem shows that q and j have at least the values 
specified. The forms 

2 {{^2 + ^3 + T (^2 + + (^3 “ 1 “ + ^ 4 } 

and 

x\- xl — xl — xl + + ;r4a:2 + X4 3;3+ 2x1X3+ 23^1 a;2 

have signature (4, 0), (2, 2), and determinants j, f respectively and do 
not represent members less than t in absolute value for integer value 
of the variables not all 0, as is easily verified. This proves the corollary 
on making use of the relationship between forms and lattices of Chapter I, 
§ 4 (especially Lemma 4). 

Again, as Mordell observed. Theorem IV gives # once # is 
known. Again, the method of proof of Theorem IV gives the lattice 
constant [;^, see Oppenheim (1953b)] of 

0 Xi + x| X3 — .^4 1 » 

once that [|, see Davenport (1949a)] of 

0 < Xi + x| — x| < 1 

is known. These sets are not star-bodies. It is necessary to choose the 
point 5 of A* so that b\-\-b\ — b\ — b\ is numerically small and negative. 
It is possible to use Mordell’s method to obtain information about the 
critical lattices when there is equality in (2). We do not do this here 
since we shall do something similar for products of linear forms in § 3-3- 

X.3.3. Before applying Mordell’s method to ternary cubics we 
must translate Theorem VIII of Chapter II out of the language of forms 
into that of lattices. 



Lemma 3. The lattice-constant of the 2-dimensional set 



: Wix)\<U 


(1) 


where 




f{x) =XiX 2 (Xi + X 2 ), 


(2) 


is A(.^) =7^. There are precisely two critical lattices, Mj and M 2 . 
lattices have only 0 in common. 


These 


Let ■&3 be the roots of 




^3 ^2 _ 2 ^ _ 1 = 0 


(3) 



in some order. Then the lattice M(i?i,?? 2 . ^ 3 ) basis Oj, 02 defined by 
7Jaj = (^ 2 -^ 3 . ^ 3 -^ 1 ), 7»«2 = R(^2-<?3).^2(^3-^l)} (4) 
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IS one of the two critical lattices. If is a permutation of , f?2 > ^3 > 

then M (i?i , ??2 > >^3) = (^1 ' ^2 > ^3) *7 *7 permutation is an 

even one. 

The geometrical purport of the lemma becomes clearer if new co-ordinates 
yj are introduced by the equations 

^1=^1. ^ 2 = - ya. 

so 

1 V3 

- ^1- ^2= - y n ~-^y2- 

In yj • ya co-ordinates, the region .T has three asymptotes at an angle of 2nj} 
and IS carried into itself by either a rotation through 2n/3 round the origin or by a 
reflection in an asymptote. The two critical lattices given by the lemma are then 
each invariant under a rotation through 2n/3 and each is carried into the other 
by a reflection in an asymptote. The reader may find it instructive to draw a 
figure of the critical lattices each with 6 pairs of points on the boundary. For a 
treatment of sets which have similar symmetry and convexity properties to 
^ by the geometrical methods of Chapter III see Bambah (1951a). 

In what follows we do not introduce y, , yj as above but we do maintain the 
essential cyclic symmetry between Xj, and — Xj— x^. 

We note that the roots of (3) are 

0j = 2cos^^, 02 = 2 cos^^, 03 = 2 cos (5) 

so that '^3 are a permutation of 0j, 02, 03- We have the trivial 

identities 

02=05-2, 03 = 05 - 2 , 01=05- 2, 01 = 1 -02-01 etc. (6) 

The value of A{.T) follows at once from Theorem VIII of Chapter II, 
so it remains only to verify the statement about the critical lattices. 
By Theorem VIII of Chapter II, if M is critical there is certainly a basis 
Oi , 02 of M such that 

VK«1 + «2«2) = -/o («!,%), (7) 

where 

/o {^1 1 *^2) ~ Ui u\u^ — 2 Uiu\ U2', ( 8 ) 

for one may interchange the two elements of the base given by Theo- 
rem VIII of Chapter II or take —a* for (k = i,2). Let 

‘^2k) {k=\,2), ( 9 ) 

and define numbers 113* by 

<^1* + ^2* + «3)i = 0 {k=\,2). (10) 

Then (7) becomes 

J7 Ki“i + «;2«2) = [ 7K + 0;«2)- (11) 

1<;^3 i^7^3 

Cassels, Geometry of Numbers 
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Hence 

= 0='1,2,3). (12) 



where ■&i, ’^2, #3 is some permutation of ©i, 02, ©3. From (10) and (12) 
we have 



^;+2 
— ■^;+2, 



(/ = 1,2,3), 



(13) 



where i?4=^i, A is some number. By (11) we have 



n «; l = 1 , 

i 

and so in fact 

A*=(^l-^2)(i?2-^3)(1?3-1?l) (14) 

= ±7, (15) 



where the value ±7 may either be checked directly from (5) or from 
the fact that the square of the right-hand side of (14) is the discriminant 
of the cubic /o(«i, by definition (§ 5-1 of Chapter II). We note that 
i?i,i?2,^3 determine and Oj absolutely uniquely, by (14). 

But now we have the identity 



f^{w + v,v) =fo{—v,w). 



Hence if the point of M {&i , § 2 , ^3) is defined by 



01 + 02-1-03 = 0, 

we have 

y> («i Og + «2 O3) = - /o ( ,'h , M2) : 

and so a^, must correspond to a permutation , i?2, i!^3 of 

and it cannot be the identical permutation by the last sentence of the 

previous paragraph. Hence the cyclic change of bases of M (t?i, #2, ^3) : 

(Oj , O2) — >• (O2 , O3) — >• (O3 , Oj) — ^ (Oj , O2) — >• 

must correspond to a cyclic permutation of ?>i, i?2, Hence there are 
at most two distinct lattices M (^1, ^2, ^3), lor the permutations ^3 

of ©1, ©2, ©3. 

It remains to show that M (l?i , ^2 , ^3) is distinct from M (©1 , ©2 , ©3) 
if ^3 is an odd permutation of ©j, ©2, ©3. We may suppose now, 

without loss of generality, that 

i?l=©2, #2 = ©l, ^3 = 03 - 

From ( 4 ), (6), (I3) and ( 15 ), a point 5 of M(©i,©2,©3) has 
74,. = P(©;) (/ = 1,2,3), 



( 16 ) 
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where P(t) is a polynomial in the variable t with (rational) integer 
coefficients. We may suppose, by (3), that P{t) is of degree ^2; and 
then it is completely determined by any one of bi,b^,b^. If 6 is also in 
M(02,6 >i, 03), then it is also of the shape 

7»6i = (?( 02), 7% = Q{e,). 743 = ^(03), 

for some polynomial Q{t) of degree ^2 with integer coefficients. But 
now P{0^ =<?(®3). ^nd so the polynomials P{t) and Q{t) are identical. 
Hence 

P(03) = P(0i); (17) 

and so 

since P(0y) (/ = 1,2, 3) are conjugates^. Finally, 

bi = bi^= b-^ b^ 

by (16) and (18), and so 

— ^2 ~ 0 ' 

That is, o the only point common to M (0j, 02, 0s) and M (02, 0i, 03), 
as required. 

X.3.4. We now apply Mordell’s method to prove results for 
and Xi{xl-\-xl). These are equivalent to weaker forms of Theo- 
rems X and XI of Chapter 2, where the relevant literature is cited. 
We shall later prove something rather stronger by the use of isolation, 
but will not prove the full force of Theorem X of Chapter 2 in this book. 
The methods extend to products of n real or complex forms in n dimen- 
sions in a way which will be obvious, but do not then give the exact 
lattice constants [Mordell (1941a) and (1943a)]. 

Theorem V. A. The lattice constant of the 3-dimensional set 

(I) 

is A (AQ = 7. Denote by Nj the lattice with basis 

6i=(1,1,1), b2=(l?i,i?2.^3), &3=(^?.^L^^). (2) 

where ^3 ''°ots of 

( 3 ) 

in some order. All the critical lattices h of which have a point a for 
which 

(4) 

^Alternatively, (17) means that P(&\ — 2) = P(0f)] and so the polynomial 
P{fi— 2) — P{t) is divisible bv — 2( — 1 . One may now put t = and obtain 

P(©2) = P(©3). 



18 * 
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are of the shape 

A=u)Ni, (5) 

where ta is an auiotnorph of . 

B. The lattice constant of 

|A:i|(;i;i + 4)<'l (6) 

is =j{ 2 })i. Denote by Nj the lattice with basis 

where = — 1 and is the real, and are the complex roots of 

+ ( 8 ) 



Every critical lattice A for which possesses a point a with 

\‘h{ 4 + 4 )\ ='• 

A = <u Nj, 



is of the shape 



(9) 

( 10 ) 



where to is an automorph of ^ . 

We first prove Theorem V. A. The lattice Nj given by the theorem 
is certainly ^-admissible, since a point a of Nj has co-ordinates 

«, = «1 + W2«^/+«3’^7^ (/ =1,2,3), (11) 

where are integers. Then a^a^a^ is a rational integer by its 

symmetry in If a^a2O^ — 0 , then one of the a^ is 0, say 

Mi + «2^i + M3^J = 0; and this is impossible unless Mj = M2 = «3=0, since 
does not satisfy any equation of degree less than 3. Further, 

(Ni) = I det [\ , b2 . 63) I = I (^1 - ^3) (^3 - ^3) (^3 - ^1) I = 7 , (12) 



as was verified already in the proof of Lemma 3. The lattices obtained 
by different permutations of #1 , i?2 > ^3 (2) differ from each other 

by an automorph of namely a permutation of the co-ordinate axes. 
Write 



and, as before. 



(p[x) =*1X2X3 



kl (A) =inf|(7>(a)l. 

a t A 
4=0 



We show first that, for any lattice A, 



{|9p|(A)}^^7-Hkl(A*)}<f='(A), 



(13) 
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where A* is the polar lattice of A. The proof follows closely the pattern 
of the proof of Theorem IV. It is enough to show that 

{l9.|(A)}‘“g7-M<p(b)|<^»(A), (14) 



where b is any primitive point of A*. 

Suppose, first, that 95(b) =0. Then, after applying a suitable auto- 
morph of furnished by § 2.1, we may suppose without loss of 
generality^ that 

b = (1,0,0) (15) 



or 



ft =(1,1,0). 



( 16 ) 



In the first case, (15), the plane 

bac = 0, 



which must contain two hnearly independent elements of A is just the 
plane % = 0, and all points on it satisfy <p{x) =0. Hence (14) certainly 
holds in this case. In the second case, (16), there are two hnearly inde- 
pendent points of A on 

*1 T 2^2 = 0. (17) 



For these points 



(p [x) — Xi X 2 x^ — ^2 *3 , 



(18) 



and the 2-dimensional set |5t25r3|<e is of infinite type for any e>0. 

Hence there are certainly points aeA other than o with |95(a)|<e. 

This proves (14) in the case b is given by (16). 

There remains the case when 95 (b) 4= 0 and so, after the application 
of a suitable automorph, we may suppose that 

b = {tj,i), t>0, (19) 

and so 

95(b) =f 3 . (20) 



We have supposed that b is primitive, and so, by Lemma 6, Corollary 
of Chapter 1 , the 2-dimensional set of points [x ^ , x^ such that 

(aJi, X2, — x^ — 

is a lattice M of determinant 



<f(M) =fi(A). 

But now 

inf^l 01(22(01 -faj)| ^{ 7 “i(^(M)} 5 = 7 ~*^*<f*(A), 

’ 4=0 

'For we may suppose that 6i=t= 0, 6j= 0. One gets the shape (15) or (16) 
according as 6j= 0 or 0. 
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by Lemma 3, the exponent f being correct for reasons of homogeneity. 
A fortiori 

This proves (14) when 5 is given by (19) and (20) ; and so completes the 
proof of (13) and (14). 

On interchanging A and A* in (13) and using d(A*) =d~^{A), we 
have 

{kl(A*)}*^7-^kl(A)}<f-3(A). (13') 

On ehminating |9)|(A*) from (13) and (13') we obtain 

|9,|(A)^7-^i(A), (21) 

so A ^ 7, since is the set of x with 1 95 (x) | < 1 ; and then A {J7[) = 7 
since we have already exhibited an admissible lattice Ni, with ^f(Ni) = 7- 
It remains to consider the critical lattices A^ with a point on the 
boundary, and we may suppose, after the use of a suitable automorph, 
that 

(1,1,1)€A, <f(A,)=7. (22) 

Clearly then the 2-dimensional lattices considered above will turn out 
to be critical for the relevant 2-dimensional sets, and it is necessary 
only to check that this can happen only when A^ = Ni for a suitable 
choice of i?j,i?2,i?3, where Nj is defined in Theorem V. A. 

We note first that 

k|(A.*)=7-=‘ (23) 

by (13) and (13')- Hence the lattice of points 

(x ^ , Xj) with (xi , X2 , — .Xi — Xj) e A* , (24) 

which has determinant 

i(M;)=i(Af) = 7-L 

must be one of the two critical lattices for 

|xiX2(xi-f Xj)! < 7'* (25) 

given by Lemma 3. But we have already seen that Ni for any choice 
of ’&3 is critical, and so the lattice 

defined by putting Nj = Nj (??, , i?2 . ^3) for \ (24), is also critical. 

Clearly, by the proof of Lemma 3, both critical lattices of (25) occur as 
M( (j?i, i?2. for suitable choice of . Hence we may suppose 

without loss of generality, that 
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that is, the polar lattices A* and Nf are identical at least on the plane 

+ 2^2 + ^3 = 0 . 

Let now b = {b^,b^, b^) be any point of Af (and so of N*) with 

I f>2 ^3 1 ~ 7 + ^2 "h ^3 ~ 0 • (2d) 

Then the lattices Aj*, N{* consisting of the points of A^ and of Nj re- 
spectively in the plane 

b\ Xj -{- ^2 ^2 T ^3 ^3 ” 0 (27) 

must both be critical, in the obvious sense, for the 2-dimensional section 
of |%iA;2ir3|<t by the hyperplane (27). By Lemma 3, there are only 
two critical lattices and these have only the origin in common. Hence 
Af and Af must be identical, since (1,1,1) belong to both lattices, by 
(27). Thus A^ and Aj coincide on any hyperplane (27) such that the 
point 5 satisfies (26). 

But now Nf has a basis b*,b*,b* (say) such that b=b*,b* 
satisfies (26), for we have only to choose a suitable basis b*, b* for the 
section of Nf by Xi~\- x^ + x^~0 and extend it to a basis for Nf. Let 
f>i, f>2- 63 be the polar basis for Nj. Then, on putting 6=6f, b* in 
(25) in turn, we see that A^ contains all points a of Nj such that either 

6 * a = 0 or 6f a = 0 ; 

that is all points of Nj of the shape either 

u^b^ + u^b^ or vib^ + v^b^, 

where , «2 . . ^3 ^^e any integers. Hence A^ must contain each point 

Uibi + u^b2 + Usbs = (u^b^ + u^bs) + (u^bi + Ob^) 

of Nj. Since i(Nj) =d(Aj; we then have Aj=Ni, as required. 

This completes the proof of Theorem V. A. That of Theorem V. B 
is similar except that Theorems VII and VII A of Chapter III are used 
instead of Lemma}. The details may be left to the reader. 

X.4. Existence of automorphs. In this section we prove the exist- 
ence of common automorphs of a lattice A and a form (p(x) which is 
integral and non-null on A, and make deductions about the possible 
such A in a special case. 

We .shall require a quantitative form of Mahler’s compactness 
criterion. Theorem IV of Chapter 5. 

Lemma 4. There is a number 



A/j = Ag («, zlj , X, e) 



( 1 ) 
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depending only on the integer n>0 and the numbers A{>0, x>0, e>0 
with the following property: amongst any N„ lattices A^- in 

n-dimensional space such that 

( 2 ) 

and 

(3) 

there is at least one pair, say Aj, A^, such that 

A2=tAi (4) 

and the linear transformation t satisfies 

|lT-i||<e ||t-1-i||<£, (5) 

where i is the identity transformation. 

We recollect that 

lA|=mf|a|, (6) 

4=0 

and that the symbol ||o|| for a linear transformation X=ax with 
is ||o||=«max|a, 4 |. 

It would be possible to modify the proof of Theorem IV given in 
Chapter V but it is simpler to follow the alternative proof sketched in 
§ 2.2 of Chapter VIII. We suppose we have Ng lattices A,, where N„ 
will be determined later. By Lemma } of Chapter VIII there is a 
Ag>0 and a K depending only on and n, such that any A, satisfying 
(2) and (3) has 

d{A^)^Ag>0 (7) 

and has n linearly independent points in the sphere 

\x\^K. 

By Lemma 8 of Chapter V, there is then a basis 

^ii> ■>Kj 

of Ay with 

\bij\^nK (i^i^n, i^f^Ng). (8) 

Let t)>0 be arbitrarily small, to be chosen later. Then, by (8), if Ng 
is greater than an depending only on n,ij,Ag,K, that is on n,t}, 
Ai, X, there are two Ay say and A 2 , such that 

\bii-bi2\<r] (i^i^n). 

Since the 5,-i are linearly independent, we have 

n 

bii-bii = Z Oij b,i 

;=i 



( 9 ) 
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for some numbers ff,y. But now on solving for the a^j from (7), (8) 
and (9), we have 

^ (1 ^ ^ w, 1 ^ w), 

where is a number depending only on A and w; a crude estimate 
being 

obtained by estimating the elements of the matrix reciprocal to the 
matrix with columns b,i Hence 

if rj chosen to satisfy nOf,rj<e. Hence t = i+o has tAi = Aj and 
||t— i||<£. Since Ai=t'*A 2 we have also i||<£, because (9) is 

symmetric in Aj, A^. This concludes the proof. 

X.4.2. We shall also require the following rather trivial lemma which 
says, roughly, that a form (p [x) cannot be integral on too many essen- 
tially distinct lattices. 

Lemma 5. Let q>{x) be a form integral on a lattice A. Then there is 
an T]>0 depending only on cp[x) and A with the following property: If 
<p{x) is integral on t A and 

then T is an automorph of <p{x). 

Let (p {x) be of degree m and let 6j , . . . , 6„ be a basis for A. If x 
satisfies (t) with sufficiently small rj, we have 

|9p(x2Myby)-95(Z",b;)|<l (2) 

for all integers My such that 

O^My^W (1 ^/^m). (3) 

Then (2) implies 

9?(T2«yby) =<p(Myby) (4) 

' i ' 

for the integers (3), since both sides of (4) are integers. By Lemma 1, 
it follows that (4) holds for all real numbers My. Since every x is of 
the shape 2 «y by with real My , we have <p{'zx) =ip (x) for all x, as required. 
Corollary. Suppose, further, that (p{x) is non-null on A and that 

d{A)^Ai 

for some A^. Then rj may he chosen depending only on <p and dj, but not 
otherwise on A. 

For then 



199 (a)! (asA, a=f=o); 
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and so 

I A| ^ c> 0 

for some c depending only on 9?. Hence, as in the proof of Lemma 4 , 
there is a basis b^, of A with 

I bj\ ^nK ^n) 

for a K depending only on zJj and c, i.e. on and (p. Hence all the 
points Zujbj subject to (3) lie in a sphere 

\x\^n^mK. (5) 

Then (2) holds for small enough rj depending only on <p and K, since 
<p(x) is uniformly continuous in ( 5 ). Hence the corollary follows. 

X. 4 . 3 . We are now in a position to prove the main theorem on the 
existence of automorphs. 

Theorem VI. Let the form (p{x) he integral and non-null on the 
lattice A and let a be any automorph of cp{x). Suppose e>0 is given 
arbitrarily small. Then there is an automorph t of <p(x) with 

l|T-i|i<e, (1) 

such that 

to = a~'*ra'’ (2) 



is an automorph of A for certain integers u, v with 

O^udv. ( 3 ) 



It is not excluded, of course, that to may be the identical trans- 
formation. 



We have 



ki (A) = mf^k(a)| ^t, 

=t=o 



by hypothesis, and so 
for all integers u. Hence 



kl(o“A)^1 

o“A|^00 



for all u and some constant c>0. Further, 



( 4 ) 

( 5 ) 

( 6 ) 



d\a'‘^) =d{N) 



( 7 ) 



for all u since det (a) = ± f by Theorem I. By (6) and ( 7 ) we may apply 
Lemma 3 to the o“A [\^u^N), if N is some large enough number, 
to obtain two lattices a” A and o' A such that 



o”A=to''A {u<v) 



( 8 ) 
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and 

||t-i||<£, 1|t'1-i||<£. (9) 



We may suppose, by choosing a smaller number instead of the ori- 
ginal £ if necessary, that E<rj, where r] is the number in Lemma 5, 
Corollary with Ai = d(/\). We may then apply Lemma 5, Corollary with 
o"A instead of A and deduce from (8), (9) that t is an automorph for 
95(05). Hence u> defined in (2) has all the properties required. 

Theorem VI becomes false if the condition that 95(05) be non-null 
on A is omitted, as is shown by the 2-dimensional example where A = Aq 
is the lattice of integral vectors, 95 (05) = Xg . and o is the automorph 
%i-^2Xi, Xj. But in more dimensions it is sometimes possible to 

use the idea behind Theorem VI to construct automorphs of A even 
when 95(05) may be null on A, for example, by restricting attention to 
automorphs leaving fixed an element or elements of A or of the polar 
lattice A*. 



X.4.4. Theorem VI takes a particularly simple shape when 



9 >(®) =\ n x,\ \ n 0 



'r-\-k 



+ 4 



+ S + *. 






( 1 ) 



where «=r + 2s, which is substantially equivalent to, but rather 
stronger than, Dirichlet’s theorem on the existence of units in an 
algebraic number field. We write as usual 

Zj = Xj (t ^ ^ r) 









= x,^„ T- i X, 
Xr + k ~ i X. 



r-\-si-k 



f-f S + ft 



(I ^ ^ ^ s) • 



( 2 ) 



It is convenient to work with the rather than the x^, so we shall speak 
of the Zj as the appropriate complex co-ordinates. We shall also say 
for brevity that a set of numbers A,- (1 g/^«) is compatible with 95(05) if 

kj = real (1 g ^ n) 

conjugate complex (1 ^ ^ ^ s) . 



Theorem VII. Let 95(05) be given by (1), and let A, (l^/^«) be 
numbers compatible with 95(05) such that 






Suppose that 95(05) is integral on A and that e>0 is given arbitrarily 
small. Then there are numbers o)j compatible with 95(05) and an integer 
m > 0 such that 



n 00 j = 1 




< e 



(1 ^ / ^ «) , 



(3) 
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and such that the transformation u given in the appropriate complex co- 
ordinates by 

is an automorph of A. 

The automorphs of q>(x) were discussed in § 2.1. From what is said 
there it is clear that if Z—iz is an automorph of cp given in the ap- 
propriate complex co-ordinates and if 

ilT-l||<M. (4) 

where n is the dimension, then t must be of the shape 

= {i£f^n): (5) 

that is, there can be no permutation of the forms on the right-hand side : 
indeed, if x is written as 2 r:,h^k> the inequality (4) implies 

k 

|T,y— 1|<1, so Tyy4=0 (1 ^ ^ , 

and the only automorphs of this kind are (5). If Z=\z is given in 
complex co-ordinates by 

it follows now that X and t commute. Hence applying Theorem VI 
with o=X we have 

w =X-“tX'' = X’”t, 

where m=v — u. Then o) does what is required. 

We shall later require to know slightly more about the automorphs 
u> of lattices on which f(x) given by (1) is integral; and it is convenient 
to prove it here. 

Lemma 6. Let <p{x} given by (1) be integral on A and let the automorph 
Z = it) z of A be given in the appropriate complex co-ordinates by 

Zj = o)jZj (1^/^n). 

Then the o>j are algebraic units, that is they satisfy an equation of the type 

f((Oj) =0, 

where 

f{t) = c^t^ + f ± 1 = 0 (6) 

for some m and Cj , . . . , are rational integers. 

Let bj , . . . , 5„ be a basis for A, so that 

u>b, = 2 



(7) 
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for some integers nij^. Since the oj by are a basis, we have 



Let 



det(wyj) = ±1. 

= {i£k^n) 



( 8 ) 



in the appropriate complex co-ordinates and let B be the matrix of 
which the rows are given by (8). Then (7) takes the shape 

B(t)=mB, ( 9 ) 



where m is the matrix with elements and u is the diagonal matrix 
with elements Wi, ...,cu„ on the diagonal. Hence 

to =B 'imB, 



and (Oi, ...,co„ all satisfy the equation / (coy) = 0, where 

/ {t) = det {t i — m) , 

which is of the form (6). 

The two following corollaries are immediate 

Corollary 1 . cji, ...,co„ satisfy the same equation of type (6) with 
m=n. 

Corollary 2. If coy is rational, then o)j—±\. 

Although we do not need it later it is interesting to note that Theorem VII 
and Lemma 6 rapidly gives a complete characterisation of the lattices A on which 
ip(x) is proportional to integral and non-null, at least when r>0. We only sketch 
the proof, for details see Bachmann (1923 a) Kap. 12. 

Lemma 7. All the lattices A on which cp(x) is proportional to integral may be 
obtained in the following way. Let Sj. .... be a set of conjugate algebraic fields 
of degree n over the field of rational numbers, where S'!, .... S', a,re real and S,.).*, 
are conjugate complex Let yjj, ...,yi„ be linearly independent 

elements o/ Sj over the rationale and let yik (i'^l^n) be the conjugate of yj j in .S; . 
Let M be the lattice with basis 

(yik- ■■■■ynk) 

in the appropriate complex co-ordinates. Then a necessary and sufficient condition 
that (p(x) be proportional to integral and non-null on a lattice A is that A be of the 
shape 

A = frM 

where t is real, T is an automorph of (p{x), and M is of the type just described. 

When r > 0, the proof is shorter than the enunciation. By applying Theorem VI I 
with 

Ai=2"-h Xj=\ (2^/^m), 
we deduce the existence of an automorph cj of cp(x) and A with 
0)i>l, |c0;i<1 {2^j^n). 



(10) 
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Since Wj, co„ all satisfy the same equation of degree «, they must all by (10) 
be precisely of degree n and so conjugates. Let 6j, be a basis for A and 

use the notation (7), (8). Then it follows from (9) that 

(ftl Pin) 

is an eigenvector belonging to toy of the matrix m. But clearly m has a set of con- 
jugate eigenvectors 

(Vii yin) 

in the fields ffy generated by toy; and if these are identified with those of the enuncia- 
tion it is easy to see that the lattice M has the required properties. 

When r=0, the position is more difficult since it may be impossible to achieve 
that the toy are all of degree n, though it is possible to make them all of degree 
\n. Let o=(aj, ...,aj and 6 = (/ii, be two linearly independent vectors 

of A in the appropriate complex co-ordinate system. Then gi(ua-\-vb) is a poly- 
nomial in the variables u and v with coefficients proportional to integers, and it 
vanishes for integers u and v only when u = v = 0. Hence is an algebraic 
number. Similarly, if c=(yj, ...,y^)&A is linearly independent from a and b, 
then the ratios aj/y, , Ppyi are of degree n as is also (/>«!+? ft )/yi for any integers 
p and q. It is not then difficult to deduce that aj/ft is in a field of degree n depending 
only on A and not on the choice of a and 6; and the rest follows with some little 
trouble. We do not go into details as we do not use the result. 

X.5. Isolation theorems. As was stated in § 1 there is a wide 
variety of isolation theorems, and it hardly seems worth while to for- 
mulate theorems of great generality. We shall instead consider only 
three concrete cases. 

We shall need the following simple Lemma which is really a simple 
case of Kronecker’s Theorem and belongs of right in Chapter XI. 

Lemma 8. Let a.,p,y, d be real numbers with ad— j3y=j=0. Suppose 
thata.jp is irrational. Then to every number e>0 there is anr] =r}{a,p,y, d, e) 
with the following property : 

For any numbers X, pi there are integers m, n such that 
\ma + np — X\<B, \my n 6 — fi\^rj . 

By Minkowski’s linear forms Theorem there are integers [m, n) =j= (0, 0) 
such that |wa+«/3| is arbitrarily small; and ma+np^O since ajp is 
irrational. Hence there are integers (wj , n^) and (iWj , W 2 ) such that 

0 < I Wja -f W]^| < £, 0 < I Waa -f «2/S| < £, 

and 

miW2=j= JWjWj. 

Put 

Xj = mja + njP, Yj = m^y + njd (/ = 1 , 2 ), 

so that 

I Xy| < £ (7=1,2), Xi Y2 =t= X2 Yl . 

Let g, (T be the solution of 

Q Xi Cl X 2 — X t Q F Vf Y 2 — pi , 
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and choose integers a, b such that 

I'* — 

Then 

|aZi + 6X,-A| =|(«-g)Zi+(i-a)Z2|^i(|Zi|+|Z2|)<£, 
and 

\aY^ + hY^~ f.i\ ^idyil +IT2I) =ri (say). 

The lemma now follows on putting 

m = am-y-\- hm^, w = a Wj + i «j . 

X.5.2. Perhaps the simplest isolation theorem is that for and 
is due to C. A. Rogers [unpublished, but see Cassels (1957a), Chapter II 
where an application to the “Markoff chain”, due to Rogers, is given]. 

Theorem VIII. Let x^x^ be integral and non-null on the 2-dimensional 
lattice A and let there be a, be A such that 

= — a< 0< 61^2 =/ 3 . ( 1 ) 

Then there are numbers rjQ>0, »ji>0 with the following properties: 

Let X be a linear transformation and suppose that 

IK-i||<»?i (2) 

and 

A 2 4 = 0 , ( 3 ) 

where the transformation X =tx is given by 

Xi “ I’ll Aj T " * 1^12 ^2 > -^2 “ ”^21 -^1 4 " 1^2 2 ^2 • 

Then there is a point c 4= o of x A such that 

-a(1 -»7 o)<CiC 2<^(1 -%). lci|<1. 

We may suppose without loss of generality that 

aj>0, f>i>0 

and so 

a2<0, 62 >0. 



By Theorem VII, there is an automorph X=tax of A of the shape 

JVi = COj Aj , JV2 = 0)2 X2 , 



where 



0<O)i<i<O)2, C0iC02=1. 

Then A contains all the points 

~ % > *^2 ^ 2 ) I ~ (^2 W > ^2 ^ 2 ) 

where m is any integer, positive negative or 0. 



( 4 ) 
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We must now distinguish two cases according co the sign of Tig. 
Suppose, first, that 

Ti2>0. (5) 

Let the integer m be determined by 



TiiajCOg ”’+ Ti2a20J2‘> 0^ TiifliCOg ” ^ + rna2C02+^, (6) 

as is possible, since Tnfli>0>Ti2a2- Then 

-01-2. (6') 

Tji fli 

Hence 

‘^”=0(rr2“), (7) 

where the constant implied by the 0 symbol may depend on «i , «2 > ^^d 
where we assume rji in (2) chosen so that, say, |th — l|<^. Put 

c=Ta,„, ( 8 ) 

where a„ is given by (4). Then, in the first place, it follows from (6) 

and (7) that 

^1~Tii^IiC 02 “f" T'i2^?2^2 — 

so 

I Cl I < t 

if tji is chosen small enough. Secondly, it follows from (6) or (6') that 
0 < w”cj g; Tiiai(1 — £0^^). (9) 

But now, by (7), 

(Oi”'C2 = T2iaiwr^”‘+ T22«2 = ^22 «2 + 2) ■ ( 10 ) 

Put = Then since a2<0<«i. we have from (7), (9) and (10), 
that 

^ 1 ^ 2(1 ^ 0 ) ^ CiCg^C 0, 

provided that T 12 , Tgi are small enough and that Th,T 22 are near 
enough to 1, which may be achieved by taking small enough in (2). 
This concludes the proof when Ti 2 > 0 . The proof when Ti 2<0 is 
completely similar, except that 6 is used instead of a. 

Corollary. Under the hypotheses of the theorem except (3), there is 
an r ]2 such that if 

and T is not an automorph of x-^x^, then tA contains a point c with 
flg (1 ^ 0 ) < ^1 Cg < Ji &2 (t *?o) ■ 
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For if T is not an automorph, then either Tj 2=(=0 or 
then the theorem apphes; and if T 2 i=# 0 then the theorem can be applied 
with the roles of and interchanged. 

Note that Theorem VIII works with the values of x^x^ at two 
distinct points of A. This rather restricts its field of apphcation. The 
other isolation theorems which we shall discuss require at most knowledge 
of the value of the function at only one lattice point. 

X.5.3. Before discussing the isolation results for 

<p(x) =%i:T 2 % 

we require a simple lemma. 

Lemma 9. Let x^x^x^ he integral and non-null on A. To every e>0 
there is an rj>0, depending on A, with the following property: 

To any numbers q> 0, a>0 and index ^ =1, 2 or 3 auto- 



morph X =u>x of A: 




Xj=o),Xj (t^/^3). 


(1) 


with 




CO/>0 (I^/^3). C0iC02O)3 = 1 


(2) 


and 




1 — £ < < 1 + £ , ~ < ri . 

0>2 ' 0 


(3) 



For by Theorem VII there are certainly automorphs 3 and of A 
defined by 

respectively, with 

0<#2<I, 0<^3<1, = 

0<y>3<i, 



Put 


^. = logi?,-, 9,=logv>,; 


(4) 


so 


Z*! + ^2 + ^3 = 9l + 9'2 + ?3 = 0 


(5) 


and 


^1>0, />2<0, ^3<0, 






9i<0, ?2>0, ?3<0. 




Hence 


~ Piil — p2iz~ 


(6) 



We now show that 

{Pi~ ‘I 2 ) (7) 



is irrational. If not, there would be an automorph X = with 

integers (m, 0 ) =|= (0, 0) for which, in an obvious notation, ^ 1 =^ 2 - But 

Cassels, Geometry of Numbers 19 
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then, Xx would be rational, since Ai, Aj, A 3 satisfy a cubic equation with 
integer coefficients by Lemma 6 , Corollary 1 . Hence Ai = A 2 = A 3 =l, by 
Lemma 6 , Corollary 2 ; that is 

upj + vqj = 0 

which contradicts ( 6 ). By ( 6 ) we may now apply Lemma 8 , with 
A=logp, o. = Pi-pi, /5=92-?i- 
^=logcr, y=pk, 

and 

min I log(l ± e)| , lpg»? 

or e, r) respectively. Then 

w = 9” ij>", 

where m and n are given by Lemma 8 , clearly has all the properties 
required. 

It is now a simple matter to prove 

Theorem IX. Let be integral and non-null on A and let £^>0 

be arbitrarily small. There exists an rji>0, depending on Cj and A, such 
that if 

ll'T - ill < % (8) 

and tT is not an automorph of x^x^x^ for any number t, then the lattice 
tA contains a point c =bo for which 

I ^1 £3 C3 1 < £j . (9) 

Let T be given by X, = 2 when X =tx. If t is not an auto- 

i 

morph, there is a Tjy=f =0 (» 4 =/)- We shall suppose that 

Ti 2 = max|T,- I > 0 , (9') 

* 4 = 1 

this being one of twelve possible cases^. Now A certainly does contain 
some point a with 

«i> 0 > a 2 - 

We shall pick one such point and keep it fixed in all that follows, so 
that numbers depending only on a and A will be said to depend only 
on A , etc. 

By Lemma 9 with an £>0 to be chosen later and 

p = — ^Tn>o, ff = l, ^=3, ( 10 ) 

aj Ti2 

* For the maximum in (9') may correspond to any one of the six pairs (i, j) 
with i#:/; and the maximal r,-,- may be either positive or negative. 
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there is an automorph to of A with 



where 



1 -- £ < 



^2 ^2 



< t + £, 



7] ^ < CO3 < ?y , 



( 11 ) 



rj —rj{A, e) ( 12 ) 

is independent of the r,y . Since is assumed near to 1 , say | — 1 1 < f , 

it follows from (11) and 0)10)20)3 = 1 that 

T;2<C0i<??'t|2. ('•3) 

where 



is independent of the r,-,-. 
We put 



rj' = rj'{e, A) 

c = Tto a€T A. 



(14) 



Then by (9'), (11) and (I3), we have 

( 1^1 riiO)i -|- ^2 Ti20)2 “T <*3 Ti3 0)3 I 

^0)r^{laiTii0)i + fl2Ti2W2l + |«3T^13ft>3|}< ^ 1 ^ 2 ’ 

where 

;«i = >ti(A), Ii = £i(A,£). 

It is important that >ti is independent of e. Hence 

< 2 xi£, ( 15 ) 



provided that T12 is smaller than a number depending on e. Similarly, 
but more simply, by (9'), (11) and (13), 

U>2^\c2\ <0)^^a)i|T2i«i| + |t22«2| + 0>3 | Tj 3 «3 [ 



where 



< I 1^22^2! +^2'^'f2> 



^2 — ^2 f 

and so 



0)2 ^Ical < 2fl2, (16) 

provided that T12 is small enough and T22 is near enough to 1. Similarly 

0)r^lc3| < 2lti3| (17) 

if T33 — 1 and rjg are small enough. From (15), (16) and (17) we have 

I ^2 ^3 1 ^ I ^1 ^2 ^3 1 ^ * 

Since e is arbitrarily small, we may put ei = 8| Xi«2^3| e. where £3 is the 
number in the enunciation. 

This completes the proof. 



19 ' 
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Note that we have used the full force neither of Lemma 9 nor of 
the inequalities (I})- 

The proofs of the following two corollaries may be left to the reader. 
Corollary 1 . Theorem IX remains valid if |ciCjC 3 |<£i is replaced 
by 0 <|ciC 2 C 3 |<£i. 

Corollary 2 . To every g2>0 there is an rj2>0 depending only on 
A, £2 Ibat, if 

l|T- l||<»?2 

and one of T 12 , T 13 , Tjj, T 23 is not 0, then there is a cexA with 
0 <C ] £1 ^2 £3 [ <C £2 I 1 j ^ 1 t i ^2 ! ^ f • 

Corollary 1 is proved in Cassels and Swinnerton-Dyer (1955 a). 
A somewhat weaker form of Corollary 2 is in Davenport and Rogers 
(1950 a). 

X.S.4. We now discuss 

q>{x) =xi{x\ + xl). 

As in § 4.4 it is convenient to introduce the appropriate complex co- 
ordinates 

■^1 = ^ 1 . •22 = ^2 + »* 3 . h = — ( t '2 = — 1 ). 

A transformation Z=xz corresponds to a real transformation for the 
real variables x if and only if it is of the shape 

= (1) 

k 

where 

■*^12=^13, T21=T3i, I'll = Til '’^23 = T32. '^22 ~ '^33 (2) 

and the bar (“) denotes the complex conjugate. 

Theorem X. Let 

f(x)=xi(xl + xl) ( 3 ) 

be proportional to integral and non-null on A and let 

A = | 9 p|(A) = inf | 9 p(a)|. (4) 

a 

+ o 

Then there are numbers rji>0, ?? 2>0 with the following properties: 
Suppose that t is a homogeneous transformation in the appropriate 
complex co-ordinates given by (1) and (2) such that 

l|T- i||<»?i. 



(5) 




Isolation theorems 



293 



Then 

(i) If Ti2 = Ti3#=0, there is a c = (yi,y^,y^)=^ 0 , in complex co- 
ordinates, in tA such that 

\YiYiYs\<M'^-ni)> l7i|<'l- (6) 

(ii) If T3i = T 2 i=j= 0 , there is a c = (Yi>Y2>Y3)^^ 

\YiY 2Y3\ < - m) • |yai = lyal <■• • (7) 

By Theorem VII there is an automorph Z=u)« in complex co- 
ordinates of the shape 

Zj=Q)jZj, ft>i(U2(U3 = I, tOi>l, (X)3=C02- 

Define numbers T >0 and y by 

w2=T-U{x). C02=T-U{-x). ( 8 ) 

where 

e(x)=e^’-'T 

If X were rational, say x=Mjv, the transformation to'’ would have two 
equal eigenvalues col, <^ 3 . which would thus be rational and so 1, 
contrary to hypothesis (cf. proof of Lemma 9)- Hence x is irrational. 
Thus by Lemma 8 with £ = ^, there is a number %>0 with the follow- 
ing property: To every pair of numbers p>0 and f there are integers 
M and V such that 

\ux-\-v -^p\<\ (9) 

and 

(iO) 

We now prove (i). Since cp{x) is proportional to integral on A, 
there is an ae A of the shape 

o = (ai.«2.«3). a2=C«{^), a.3=Ce{—&), (11) 

where 

ai>0, C>0, 

and A is defined by (4). Put 

Ti2=-ct£(v<), r^3= -cre(-f), (12) 

where a>0. Then a is small when ||t — i|| is small. We now choose 
integers « and v to satisfy 

\»X + v-(y> + '&)\<\ (13) 



2%ctC ’ 



[cf. (9)] and (10) with 



Q 



(14) 
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SO that 



V 3 ^< 



2g;r3« 

Til ®1 



(15) 



Since Th is near 1, there are two constants rj', rj", depending only on A 
(and o), such that 

0 < rj' <T“ < r]"a~K (16) 



We shall show that the point 



c=TO> “a = (y1.y2.73) 



(17) 



satisfies the conditions of Theorem X in case (i). In the first place, 

=i«i^ii- T^'‘aC{e{&+y)-ux)+e(-‘&-y) + ux]}\ | 

= |aiTii — 2 cos 2::r(«? + ^ — m;^)| > ( 18) 

^ ai Til (1 - i < «i (1 - i 7 ] 3 ^) j 



by (11), (13), (15), provided that ||t — i|| is small enough. Further, 
r-“|y3| = 7'-“|y2l^|T2ilai7'-^“+C|T,,|+C|r,3|<C(1+c) (19) 

for any given e> 0, provided that ||t — 1||, and so also a, is small enough. 
From (18) and (19) we then have 

lyiyzysl < «ifMi - i V3 ^) (i + «)* < «iCMi - i = ^(i - i vz^) ■ 

if e was chosen suitably. Since (16) and (I8) clearly imply |yi|<l if 
1|t — i|l is small enough, this completes the proof of (i) of the theorem 
with rj3^. 

The proof of the second part is similar on considering xta'*a with 
suitable positive integer u. The details may be left to the reader. 

For a later application we note the 

Corollary 1. The numbers rj^ and rj2 may be chosen so that the con- 
clusion of the theorem holds uniformly for all lattices A=XM, where M 
is some fixed lattice on which cp{x) is proportional to integral and non- 
null and X runs through all automorphs of <p(x). 

It is clearly enough to consider the case when Z =X« is of the type 
Z~XjZj. Then u is an automorph of A if it is of M. Hence the only 
non-uniformity is possibly introduced by the point a. But clearly there 
is a number R depending only on <0, and so only on M, such that 
|co*a|<i? for some k. If t«)*o is taken for o, there is then complete 
uniformity in the estimates. 

Corollary 2. When t is any automorph of cp (x) with 

0 = Ti2 = Tj 3 = T21 = T31, 
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then 

jnf 1 95(xa)| g I Till (I ^22! - I ^23!}"^ • 

+ 0 

We may suppose that A—\ and that a =^ = (1,1,1). For any 
integer u positive or negative we have 

|95(xco“e)| =\r^•^\\x^^e{ux) 

where % is given by (8). By Lemma 8, we may choose u so that 

1 '*^22 e{ux) + '^2ze{-'ux)\ 

is arbitrarily near to 1 1 Tgjl — I Xjjll, and the corollary follows. 

Note that 



= l Till 11^221^- |T231"1 ^|Tii|{|T22| “1^231}", 

with equality only when T22 = 0 or t 2 s = 0, i.e. when x is an automorph 
of f{x). 

X.6. Applications of isolation. Following Davenport and Rogers 
(1950a) we first use isolation to strengthen Theorem V. For x^{x\-\-x\) 
it gives the best result to date, but for x^X 2 X^ more is known, see Theo- 
rem X of Chapter II, which is not proved in this book. 

Theorem XI. A. There is an ??i>0 such that every lattice A admis- 
sible for 

J/[: I Xi Xj T3 1 < 1 

and with 

i(A)<7(1 +i?i) 

is of the shape 

A = f u) Nj, 

where w is an automorph of .TJ, and Ni is defined in Theorem V. 
B. There is an r] 2>0 such that every lattice A admissible for 

lxi(x| + xi)| < 1 

and with 

^f(A)<i-(23)i(1+»?2) 

is of the shape 

A = xo) N2, 

where N2 is defined in Theorem V B, ta is an automorph of JT 2 and x 
is a transformation = Tjj^Xi, with Ti 2 = T]3=T2 i = T 3 i = 0. 

k 

We first prove A by reductio ad absurdum. Suppose, if possible, 
that rjx does not exist. Then there exists an infinite sequence of admis- 
sible lattices M,, (1^r<oo), none of the shape fciuNi, and such that 
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Now 

inf 

o€Mr 

by Theorem V, and since M, is admissible ; and so there is a sequence 

of points 

«, = («lr.«2r.«3r)eM, 

such that 

l«lr«2f«3r|^f (r Oo) . 

On replacing M, by a>, M,, with a suitable automorph co, of we may 
suppose that 

a,= (s,,s,,s,), s,-^l 

By Mahler’s compactness principle, there is a convergent subsequence 
of the M, which we may also call M,, say 

(1) 

Then (M) = 7 and M is ,T^-admissible, so is critical. Further, (i , 1 , i ) 6 M 
and so, by Theorem V, we have 

M =SNi 

where S is an automorph oi In particular, is integral on M. 

But now 

M, = t,M 

for transformations t, such that 

K-MKO (y^oo). 

Since M, is ,T^-admissible, the transformation t, must be of the shape 
for some number and some automorph of by Theo- 
rem IX, provided r is sufficiently large. This contradicts the definition 
of the M,, . The contradiction proves Theorem XI A. 

The proof of Theorem XI B is similar but using Theorem X instead 
of Theorem IX. The details may be left to the reader. The only point 
to notice is that if t and o> are as enunciated in the theorem, then 
TU) =u>'t' for some u>', t' with similar properties to to and t respectively. 

Corollary to Theorem XL B. To every e>0 there is anrj—ris (e) > 0 
such that every admissible lattice A for 

rf(A)<i(23)i(l+%) (2) 

is of the shape A=Tt«jN 2 , where t, to are as in the theorem and 



||T-l||<£. 
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We take %<»?2 fo*" the of the theorem, so that A=to)N 2. We 
may suppose that Tu> 0 and then, incorporating an appropriate auto- 
morph in to, that 

= t- (3) 

Then 

t +%> 1-^231*} (4) 



where we use the appropriate complex co-ordinates for t as in § 5-4. 
But now 



{I '^ 22 ! |'*^23|}^=t 



(5) 



by Theorem X, Corollary 2 since A is ,T^-admissible ; and so, in particular. 



{haai - ItjsIP 



< f +%■ 



Hence if % is small, either IT22I/IT23I or IT23I/IT22I is small; and we 
may suppose the latter on incorporating in 01, if necessary, the trans- 
formation which interchanges ^2 and x^. We may further incorporate 
in ti> a transformation of the type 



x^^e{x)x^, x^^e{-x)x^. 



where e{2)=e^’"^ and x is chosen to make t2 2 real and positive. Then 
from (4) and (5) we see that T22— f and T32 are small if rj^ is small. 
Since T33=T22 and T23=T32, this proves the corollary by (3), and since 
the remaining terms are 0. 

X.6.2. The following interesting result about x^x^x^ has no analogue 
for Xi(x| -f X3), since it depends on the fact that e in Theorem IX may 
be chosen arbitrarily. There is, however, a corresponding result for 
^1 + ^2 — -*3. see Cassels and Swinnerton-Dyer (1955a). 

Theorem XII. Suppose that for some number D there are infinitely 
many lattices M, (l^»'<oo), admissible for 

■ \xi X2 Xj^ <c i , 

with d(Mf)'^D; and such that no two, M', M", say, are of the shape 
M" = fu>M', where t is a number and u> an automorph of Then 
there is a lattice A admissible for with d(A)^D on which x^x^x^ is not 
proportional to integral. 

For the lattices M, have a convergent subsequence, say, without 
loss of generality 

M, -> A (r -> 00) . 



If X1X2X3 were proportional to integral on A, then by Theorem IX and 
since M, is .T^-admissible, we should have for all sufficiently large r 

M, = f,a>,A 
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for some numbers t, and some automorphs u), of This clearly 
contradicts the hypotheses of the theorem. 

As stated in § i , it is unknown whether such a JD or such a A exists. 

X.7. An infinity of solutions. We now prove some results of Da- 
venport and Rogers (1950a) about the existence of infinitely many 
points of a lattice in certain point-sets with groups of automorphisms. 
They prove more than we do here ; the reader is referred to their inter- 
esting memoire for the details. 

The following trivial lemma gives almost all we need for the first 
type of result. 

Lemma 10. Let S2 he some group of homogeneous linear transforma- 
tions <j>. Suppose that for every a; =J= o and every number r there is an toeS2 
such that 

|toac| > r. 

Then for every pair of numbers c, C with 

0 < c < C < oo (1) 

and every number r there is a finite set of elements u)i , . . . , ti>„ of £2 such 
that 

max > r (2) 

for all X in 

c^|a;|^C. (3) 

This is a simple application of the Heine-Borel covering theorem. 
The infinitely many open sets -^(w) of points x such that |wa:|>r 
cover the compact set (3). Hence a finite covering may be selected from 
the 

Theorem XIII. Let the houndedly reducible^ star-body have a group 
S2 of automorphisms lo such that to every a:=)=o and every r there is an 
o>e£2 such that |fa>ac|>r. Then to. every integer k>Q there is a bounded 
set 6 ^ 1 , contained in such that every lattice A with d{/\)<A ( 6 ^) has at 
least k points in other than o. 

That 6^1 exists is equivalent to the statement that is boundedly 
reducible. We suppose ^ has been found and deduce the existence of 
^k+i- We may suppose without loss of generality that is the set 
of points of 6 ^ in some sphere 

|a:| ^ C = C*. 

Further, there is a positive number c*<C such that the entire sphere 

|a:l^(A-l-1)Ci (4) 



* For definition, see Chapter V, § 7.2. 
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is contained in y. We denote by the set of points of ^ in 

|a?| ^ Q 11 , (5) 

where 

C*+i>max {Q,(A + 1)c*} 

is so large that (5) contains all the sets where thetOy 

are given by Lemma 10 with c=Ci, and r = C=Cf.. We must verify 
that has the required properties. 

By hypothesis, if d[/\)<A(S^) there are k points of A in ,5^ other 
than o. If one of them, say a, is in | x | < c* , then all the points 

la (1^/^A + 1) 

are in |x|^Cj_,_i and in 6^, so in as required. Otherwise, there 

is a point b of A in for which 

c = Cjg|5|gC = Q. 

Hence there is an automorph co, of the set tOi , . . . , to„ such that 
|u)yb|>C. Hence b€<o,“^5^. But now, since u)y is an automorph, 
we have 

I det tjy I = 1 , 

and so 

d(to^A)^d(A)<A{^). 

Hence by the defining property of there are k points of a>, A in 5^ , 
that is there are k points of A in a)“^ These together with b give 
A + 1 points of A in ^k+i> required. 

Corollary. When is fully reducible^, the conclusions of Theo- 
rem XIII continue to hold when d{A) =A(S^), provided that A is not a 
critical lattice of 9’. 

For the existence of ,5^ is equivalent to the statement that ^ is 
fuUy reducible, and the induction now goes as before. 

When the star-body is not boundedly reducible only slightly less 
than Theorem XIII is true. 

Theorem XIV. Let 6^ be a star-body and A^ any number in 
0<A^<A{^). 

Then to every integer k there is a hounded star-body (depending also 
on A^) such that every lattice with rf(A)^Ji has at least k points other 
than o in . 

We may suppose that ^ is open. Suppose, if possible, that for 
every integer r there is a lattice A, with d{Af)^A-^ which contains no 

' For definition, see Chapter V, § 7.2. 
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point other than o of if in Then Mahler’s compactness 

theorem apphes, and there is a lattice A' which is the limit of a conver- 
gent subsequence of A, . Since i (A') ^ and A' is ^-admissible, this 
contradicts the definition of A{if). The contradiction shows that 
exists. The induction from to then goes exactly as for Theo- 
rem XIII. 

X.7.2. Where they apply, isolation theorems may give stronger 
results than those § 7-1, as the following example shows. 

Theorem XV. Put 

?>(®) =^ i («2 + 4 ). ( 1 ) 

There is a number rjo >0 such that every lattice A has one of the following 
two properties. 

(i) there is a number t such that the set of x.^-co-ordinates of tA is 
identical with the set of x^-co-ordinates of the critical lattice Ng o/ 1 95 (a5)| < 1 
occurring in the enunciation of Theorem V B, or (ii) for every e>0 there 
is a point a =|= o 0/ A such that 



\(p{a)\<-^H -rj(,)d(A), lai|<e. (2) 

(23)* 

We will choose rjo later in the course of the proof. Suppose that (ii) 
is false for some particular A and s. For integers r = 1, 2, . . . , let A,, be 
the set of points (xj, Xj, Xa)€A. Then there is a 

convergent subsequence 

M* = A,^^M, (3) 

and M is admissible for 

\xi(x\-\- x\)\ <_^(1 -riQ)d{A). 

(23)i 

Hence by Theorem XI B, Corollary for any given Sq we may choose 
Tjo=r]o{so) so small that 

M=fTfc)Na, ||t — i|l<£o, (4) 



where t , 10 are as in Theorem XI B and t is some number. We take 
for £q the number j/j which occurs in the enunciation of Theorem X 
and its Corollary when M =Na. By (3) and (4) we now have 



for some o* such that 



f o* (») Aa 

l|o*- i||<£o. 



(5) 



for all sufficiently large k. Clearly M, does not contain any points 

C = (yi.y2,y3) with |yi|<l and |yiy2y3i<— ^ if ^ is suf- 

(23) 
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ficiently large. Hence, by Theorem XI and its Corollary, if is small 
enough, there is z.a=a^ such that CTjj = ( 7^3 = 0 in the obvious notation; 
indeed this happens for all sufficiently large k. But then, by (5) this 
implies that (i) holds. This concludes the proof of the Theorem. 

There is a similar result where | aj | < e in (2) is replaced by e, 

cf. Davenport and Rogers (1950a). 

X.8. Local methods. For many questions concerning indefinite 
quadratic forms the appropriate tool is the theory of continued fractions. 
We only mention the topic briefly here since the application to specific 
problems not infrequently involves detailed calculation. Continued 
fractions appear very naturally from the point of view of the geometry 
of numbers. We sketch the connection here and refer the reader to 
the author’s Cambridge Tract [Cassels (1957a)], where they are intro- 
duced in a similar spirit 1 in a slightly different context, for a fuller treat- 
ment and references. There a knowledge of the geometry of numbers 
could not be assumed. For another account of the relationship of 
continued fractions to quadratic forms see, for example, Dickson (1929a). 

Characteristic applications of local methods are Markoff’s original 
treatment of his chain theorem (Markoff l879a), [there is an account 
in Dickson (1930a); compare Chapter II, §4)]. the paper of Blaney 
(1957a) that will be discussed in Chapter XI, §4, and the paper of 
Barnes (1951a). But apphcations are almost everywhere dense in the 
hterature. 

Let us suppose for convenience that the 2-dimensionaI lattice A has 
no point except o on either a.xis. Then no two distinct points of A have 
the same Tfj-co-ordinate or the same Xj-co-ordinate. There certainly 
exist points Xo=(%o. ^20) of ^ such that o is the only point of A in 

|%|^|*lo|' |^2 l^|^ 2 o|- 

Let ±aJi= ± (Xji, «2i) 4=0 be the points in |«i|<|^io| for which 
is least. Then there is no point except o in 

l^l| l^iol ' 1 ^ 2 ! |^2l| > (I ) 

and a fortiori in 

l-^ll<|^ll|> |'*2| < |-^2ll • 

We may then repeat the process with instead of to obtain a se- 
quence of points aCi , ®2 , ... . Similarly we may start with Xq and inter- 
change the roles of and to obtain a sequence of points ac_j, x_2 

There is thus a sequence of 

= (Xjy , Xij) (- 00 < / < 00) 

* Which goes back to Felix Klein (l89Sa and 1896a). 
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such that there is no point of A except o in 

I % 1 < I % ; I > 1 ^2 1 < I ^2, > + 1 1 • ( 1 ) 

Clearly a necessary and sufficient condition that a point ygA should 
occur as d:®; for some / is that there should be no point of A except o 
in I Xj I < I yj I , I ^2 1 < 1 y2 1 • Hence the sequence of pairs d;®; is completely 
determined by A, although the particular pair chosen to be di®o is, 
of course, arbitrary. If u) is any automorph of then the sequence 
of pairs for to A is either if to does not interchange the axes of 

co-ordinates, or (i.e. in the reverse order) if it does. 

Since there is no point of A in (1) except o, there is no point of A 
in the closed triangle with vertices o, except the vertices; 

and so Xj, Xy+j is a basis of A for each /, by Lemma 6 of Chapter III. 
We must now introduce an asymmetry between the x^- and x^-a^y-es to 
study the relationship between the various bases Xy.Xy^i. We choose 
Xy to be that point of the pair ^Xj for which 

X2y > 0 (all j) . (2) 

Then 

^1; ^1,7+1 (3) 

since otherwise Xy,_i— Xy would lie in (1). Since both Xy„i,x, and 
Xy,Xyy^i are bases, we must have 



*y+ldia?y_i = ayXy 

for some integer ay . Since 



we must have 



•*^2,; + l > ■*2,7 



> ^2, 7-l> 



«y > 0 . 



Then we must have the — sign in (4), since 



(4) 



I ^1,7+1 1 < |%7'| < 1^1, 7-1 1 • 

and (3) holds for every j. Hence there is a sequence of integers ay>0 
such that 

*7 + 1 *7-1 = ^7 *7 • 

It may be shown that if two lattices have the same sequence of integers 
«y then they are identical up to a transformation of the type 



ft>2 X 2 ■ 

Further, to every sequence of positive integers «y there is a lattice. 

Hence it is natural in 2-dimensional lattice problems about x^X2 to 
consider not the lattice A itself simply, but the sequence a^. It turns 
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out that the behaviour of any particular basis, say Xj,Xj^i, of A is 
influenced very strongly by the value of Uj for / near to / but only 
very weakly by for j remote from /. In many problems it is possible 
to study the behaviour of only a few Uj at a time. Hence the name 
“local methods”. 

It would be interesting if local methods could be successfully extended 
to problems in more than 2 dimensions, for example to problems relating 
to x-^ma.x{x\, x\), Xt^{x\-\-x^, x\-\- xl — x\ or x^x^x^. The difficulty is 
not to find the analogues of the x^ but to devise techniques to cope 
with their interrelations. Continued fractions have however been 
generalized to 2-dimensional lattices over a complex quadratic field, 
i.e. substantially to certain special 4-dimensional lattices, see Poitou 
(1953 a) and the references there given. 



Chapter XI 

Inhomogeneous problems 

XI. 1 . Introduction. As previously, we say that points Xy and x.^ are 
congruent modulo A, written 

Xy = *2 (A) , 

where A is a lattice, to mean that Xy—x^^/\. The set of points x con- 
gruent to a given point Xg modulo A is called a grid^ @ : A will be called 
the lattice of the grid and we shall call 

d(&)=d(A) 

the determinant of the grid. The characteristic inhomogeneous problem 
of the geometry of numbers is to find conditions under which a grid 
has a point in a given set 

There is a wide variety of different problems. Thus one may be 
concerned with all grids of given determinant d(&) or one may have 
information about the lattice A. Many of the fundamental techniques 
for inhomogeneous problems are natural extension of those for lattices 
[compactness theorems and so on; for bodies with automorphs see 
Swinnerton-Dyer (1954a)]. For some specialized problems some 
extremely powerful and delicate techniques have been developed which 
would take too much space to discuss properly. Hence this last chapter 
will have more the character of a report and less that of a detailed 
exposition. 

^ Other terms are inhomogeneous lattice or non-homogeneous lattice. 
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XI. 1.2. The following simple result due to Macbeath (1951a) helps 
to fix ideas. 

Theorem I. Let the set have finite volume V{£^) and let e>0 be 
given arbitrarily small. Then there are grids @ with <f(@) =e having no 
point in LT’. 

We may choose R so large that the portion of L/’ in has 

volume <j£. Let A be the lattice with basis 



6i = {4/?, 0, 0,...,0) 
b^^iO.rj, 0 , 0 , ..., 0 ) 
bs = ( 0 , 0,rj,0,...,0) 

b„ = (0, 0,0,..., 0,7?), 

where 

ARrf''^^ = e. 



( 1 ) 



( 2 ) 



Every point of space is congruent modulo A to precisely one point 
of the parallelepiped 

The volume of 0* is V[0) =d{f\)=e, by (2). If a point x'=^y'jbj 
of 0 is congruent modulo A to a point x^ in | | ^ R, then clearly 

Hence the set of points of 0 with this property has measure 
at most ^e. But now the set of points x^ of 0 with | ajj] >i? has volume 
at most je by construction; and hence so has the set of points x" of 
0 which are congruent to at least such one point (compare the proof 
of Theorem I of Chapter III). Thus the set of points of 0 congruent 
to a point of 0 has measure at most Je + je<e = F(^). There is 
thus a point Xq€0 which is not congruent to any point of 0. The 
grid @ of all points congruent to a;,, modulo A clearly has all the proper- 
ties required. 

XI. 1.3. We shall mainly be concerned with star-bodies 0 defined 
by a distance-function, 

0: F{x)<\. (1) 



For any lattice A and any point x^ we write^ 

m[Xo) — m{xQ, A) = inf F(x), (2) 

X5Xq(A) 

1 So m{x^) =E(Eo) in tlie notation of Chapter VII § 2.2, where is the element 
of the quotient space to which belongs. 
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and 

/t(A) =supw(a;o. A). (3) 

Clearly 

fi{tA) = \t\/i(A) (4) 

for any 1 4= 0. 

The infimum in (2) need not be attained, though it clearly is attained 
when the set F(a;)<1 is bounded. The function m{x„) need not be 
continuous, but it is semi-continuous; 

lim sup (») ^ w (Xq) . (5) 

Indeed given any e>0 there is a point aeA such that 
F{Xo + a) <m{XQ) -h s, 

and then 

F{x + a) <m{Xg) -|- e 

for aU X in a neighbourhood of Xq, by the continuity of F(x); so 
w(x)<w(Xo) +£ in this neighbourhood. Again, when F(x)<l is 
bounded, the function m (x) is readily seen to be continuous. The reader 
will be able to supply the proofs of the positive statements just made 
on the lines of the proof of the semi-continuity of the function F(A) 
in Chapter V, § 3-3- Examples to show that the infimum in (2) need 
not be attained and that m{x) need not be continuous are provided in 
2 dimensions for certain lattices A when F(x) This case has 

imphcations in the theory of algebraic numbers and has been extensively 
investigated both because of this and because of its intrinsic interest; 
see Barnes and Swinnerton-Dyer (1952a, b and 1954a) and Barnes 
(1954a), where there are extensive references to earlier work. There is 
some work on similar lines for (n=}), but it has not been 

carried so far, see Davenport (1947c), Clarke (1951a) and Samet 
(1954a, b). 

From the definition (2) it follows that m{x) may be regarded as 
defined on the quotient space ^/A (compare Chapter VII). Since this 
is compact, it follows from (5) that the supremum in (3) is always 
attained; that is, there is an Xj such that 

^ (A) = m (Xi , A) . 

Of course the infimum in (2) need not then be attained for Xi = x„. 
With unbounded sets F(x)<1 there may be again a phenomenon of 
successive minima; that is, it may happen that 

sup m (Xo) 
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Indeed some rather elaborate patterns of successive minima have been 
found, see the papers of Barnes and Swinnerton-Dyer just quoted. 
The quotient 



d{A) 



( 6 ) 



is unchanged on replacing A by tA, by (4). We shall write 



where possibly b (F) = 0. 
now show that 



b(F) = infi^^i^, 

^ ’ A d { A ) ' 
If the set F(a;) < 1 



( 7 ) 



has finite volume Vp, we 



h(F)^Vf\ 



( 8 ) 



Let A be some lattice and e>0 be arbitrarily small. There is a point 
acj congruent to any given point and satisfying 

F ( Xj ) < fi { A ) + s . (9) 



Hence the set (9) must have volume at least d(A). Since the volume 
of the set of points acj satisfying (9) is 

{/W(A) + fi}" hjr , 

the required result (8) follows. 

We shall show in § 3 that if the body F(a:)<1 is bounded, the 
infimum in (7) is attained; that is there is a lattice M such that 

{^.(M)}'' = b(F)i(M). 



We shall treat the estimation of b(F) for convex distance-functions 
F in § 2 where the relevant literature will also be discussed. 

When 1^=00 it is, of course, still possible that b(F)>0. In par- 
ticular, Davenport (1951 a) showed this to be the case for the 2-dimen- 
sional distance-function 





F{x) 


= 




His estimate. 










b(F) 




1 

128 » 


was improved to 










b(F) 




1 

45-2 



( 10 ) 



by the author [Cassels (1952a)], with a probably simpler proof. This 
has recently been improved by Ennola (1958a) to 

b(F)^(l6 + 65)-i= 3j4-, 

by a modification of Davenport’s original method. On the other hand. 
Miss Pitman (1958a) has shown that 
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More recently^, she has obtained an even smaller upper bound for 

b(F). 

The problem of determining b (F) for F given by (fO) is closely related 
to the problem of determining the real quadratic numberfields with a 
Euclidean algorithm. Davenport extended his work to number-fields 
of two other types corresponding to 

F® = {x\ + x\) and F*=={xl + x|) {xl + xl) . 

These results were proved by the author [Cassels (1952a)] much more 
simply and with a better estimate of b(F). 

Hlawka (1954c) has generalized these results to any distance- 
function F{x) in n variables which may be put in the shape 

{F(ac)}" = (F, , . . . , x,)Y {F„_, . . . . , 

where i;,F„_, are r- and (w — r) -dimensional distance-functions such 
that the star-bodies (ac) < 1 and i^_,(a;)<1 are bounded. We do not 
prove these results here. A closely related problem is treated in the 
author’s tract [Cassels ( 1957 a) Chapter V, § 6], where there are further 
references. 

In general it appears to be a difficult problem to decide whether 
b (F) =0. Thus it does not appear to be known whether this happens for^ 

F{x) = \x\-\- x\ — w = 3 

or 

F(a!) = 1^1 X2X3 1 1 « = 3 . 

XI. 1 . 4 . It follows at once from Macbeath’s Theorem I that 



®(F) =sup 



d(A) 



is 00 whenever F)r<oo. In § 4 we shall be concerned with ®(F) for 

F = |xi . . . 

It was conjectured by Minkowski that ®(F) =2“", but this has been 
proved only for « = 2 . 3 , 4 . We shall give references and a further 
discussion in § 4 . We shall also give a result of Chalk about the set 

x^x.^... x„-^\ Xj>0 

(not a star-body!) and quote other work about sets defined in term 

01 Xi...x„. 

’ I am grateful to Miss Pitman for allowing me to refer to this unpublished work, 
now published. Acta Arithmetica 6 (i960), 37—46. 

^ The first case has been settled by E, S. Barnes [J. Austral. Math. Soc. 

2 (1961/62) 9 — 10], who shows that b(P) =0. 
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The value of ® (F) for 

F{x) = \xl + xl\i (m = 3) 

has been found by Davenport (1948 a) who showed it to be isolated 
and the investigation of the successive minima was carried further by 
Barnes (1956a). More recently Birch (1958a) has found 5)(F) for 

F{x) =\xl^ h 4 - 4+1 4r\^ {n = 2r) , 

for all 2. Estimates for 



F{X) =\xl-\ h - 4+1 4„\i 



with r>0, « — r>0 have been given by Blaney (1948a) and improved 
by Rogers (1952a) and Miss Foster (1956a). All the work just 
described is of course equivalent to finding the best possible constant 
rj, ^ such that 



sup 

Uo real 



inf I / (u 

14 integral 






for all indefinite quadratic forms / of signature (r, 5) with r -\-s=n and 
with determinant D. We shall not discuss this work further in this 
book but refer the reader to the original memoires. 

XI. 1.5. For some functions F{x) there are inequalities, valid for 
all A, connecting 

/i(A)=sup inf F[x) 

Xsa»j(A) 

and 

jF(A) = inf F(ac) 

=t=o 

or, more generally connecting fjt[/\) and the successive minima of F{x) 
with respect to A. When F{x) is convex, there are further relations 
with the corresponding quantities for the polar distance-function F*(x) 
and the polar lattice A*. These relations go under the general name of 
transference theorems^ (Ubertragungssatze) . Thus Dirichlet’s hexagon 
Theorem VII of Chapter IX may be regarded as a very precise trans- 
ference theorem for We shall discuss transference theorems 

for convex functions F[x) in § 3- Much interesting work has been done 
on transference theorems for the non-con vex F{x) defined by 



(»)}": 



lliJ< i^^r+k+4+s + k). 



where «=>■-)- 2 s, but here we can only refer the reader to the paper 
of Davenport and Swinnerton-Dyer (1955a), where references are 
given to earlier work. There is a striking related result in Swinnerton- 
Dyer (1954a). 



1 Presumably because information is transferred from one problem to another. 
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There is a further type of result which may most appropriately be 
mentioned here since they are transference theorems of a sort. Barnes 
(1950a) showed that if 

and if A has the basis Oj , then 

2/^(A) g max{F(ai),F(c2), minA(ai i a^)}. 

Other results of this general kind are known, see Bambah and K. Rogers 
(1955 a) and the references given there. In particular. K. Rogers (1953 a) 
showed that Barnes’ result is true for all distance-functions F(x) such 
that F(x)<i has the same general appearance as < 1- The proofs 
are all elementary and tend to involve a tedious splitting of cases. We 
do not discuss them further in this book. 

XI. 2. Convex sets. In 2 dimensions the problem of finding b(F) in 
the notation of (7) of §1.3 for convex functions F is completely solved 
by the following result [Bambah and Rogers (1952a)]. 

Theorem II. Let ^ he a closed 2-dimensional convex set and some 
number. A necessary and sufficient condition that there exist a lattice A 
with d (A) = 2I1 such that every point is congruent modulo A to a point 
of LA is that there exist a convex hexagon^ 3^ inscribed in ^ , which is 
symmetrical about some point and has an area V(3f) =Ai. 

Note that ^ is not required to be symmetrical about any point. 

Suppose, first, that 3^ exists. We may take the centre of 3^ as 
origin o. Let A be a critical lattice for 23^. Then d{A) = V{3^) =A^, 
by Lemma I3 of Chapter V. Hence by Theorems II, III of Chapter IX 
applied to 2 3if, and since 3^ is closed, every point is congruent modulo A 
to a point of 3^, and so of 

Suppose now that there exists a A such that every point is congruent 
modulo A to some point of y. If is unbounded, there is clearly 
nothing to prove, so we may suppose without loss of generality that 
is bounded. We shall construct the hexagon 3^ in stages. Suppose, first, 
that there is an a =(= o € A such that and a have inner points in com- 
mon. By taking 2*a with suitable integer 0 instead of o, we may sup- 
pose without loss of generality that 6A 2a and ^ have no inner points 
in common. Then there exist points c and d on the boundary both of SA 
and -^a such that the portion of the boundary of 6A between c 
and d (taken in an anti-clockwise direction, say) lies in y’-fa and the 
portion of the boundary of y-l-a between d and c lies in Then 
c — a and d — a are common to the boundaries of 6A and £A—a. Let 

* a parallelogram being allowed as a degenerate hexagon. 
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^ be the portion of ^ lying between the line joining c and d and the 
line joining c — a and d — a, and taken closed ; i.e. including the points 
of y on those lines. Then clearly is convex and every point of the 
plane is congruent modulo A to a point of After a finite number 
of steps (since ^ is bounded) we obtain a closed convex set S' 
such that every point is congruent modulo A to a point of S' but no 
two sets S' and S' a, aeA have inner points in common. Then every 
boundary point of S' is also a boundary point of .^ + a for some a=)=o 
in A. Since S' and S' a are convex, this common boundary is either 
a point or a line-segment. Since S' is bounded, only a finite number 
of a come into consideration, and so is a convex polygon. We must 
now show that it is symmetric about some point. Let the vertices of 
S' he Cl , . . . , c„, where the hne segment is the common boundary 

of S' and S'-\-Uj, A. Then the line-segment — Oy) is the 

common boundary of S' and S'—a^. Hence m is even, m—2l, and 

®)±( = — ®;. 

®)+; = ®/+i ~ +;+i = • 

Hence 

+ c,+i) = \ (^y+i + *^y+i+i) 

for each /, so e =-j(Cy-fCyy.;) is independent of j. Clearly S' is sym- 
metric about e. 

We may suppose without loss of generality that e=o. Then A 
gives a lattice packing of S' (or, more precisely, of the interior of S') 
and every point is congruent to some point of S' modulo A. Hence S' 
is a hexagon by Theorems II and VI of Chapter IX. This concludes the 
proof of Theorem II. 

Using known results about hexagons inscribed in convex sets, Bambah 
and Rogers (1952a) deduce in our notation (§ I. 3 ) that 

l^U^b(F)^| 

for a convex 2 -dimensional distance-function F inequality and the 
stronger inequality 

l^F,b(F)^^ 

f F is symmetric. The equalities on the right-hand side are attained 
when F(ac)<l is a triangle and a circle respectively. The left-hand 
inequality, which is valid whether F is convex or not, was obtained 
in § 1 . 3 . 

There is a theory of lattice coverings and non-lattice coverings which 
is closely analogous to the theory of packings discussed in Chapter IX. 
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For details in 2 dimensions see Fejes Toth (1950a and 1953 a) and 
Bambah and Rogers (1952a). 

Not much is known about b(F) in more than 2 dimensions. When 
F(x)<\ is the unit 3-dimensional sphere, the precise value has been 
found by Bambah (1954b), and other proofs have been given by Barnes 
(1956b) and Few (1956a); but all proofs are fairly complicated. The 
4-dimensional sphere has been considered by Bambah (1954a), who 
obtains an estimate for b(F) and gives a conjecture for the correct 
value. Estimates for b(f) above and below and also for the corres- 
ponding number for non lattice coverings have been obtained for 
M-dimensional spheres, see Bambah and Davenport (1952a), Daven- 
port (1952b) and Watson (1956a) for the lattice case, and Erdos 
and Rogers (1953a) and Rogers (1957a) for the non-lattice case, the 
last treating general convex sets. Very recently Rogers (1959a) has 
obtained much stronger results by more powerful methods. 



XI. 2.2.1 Rogers (1950b) has given an elegant proof of the following 
result relating b (f ) to the function 



d(F) =sup 



d(A) 



introduced in § 4 of Chapter IV. 



Theorem III. 

b(F)g2-"3’‘“‘<5(F) 



for all symmetric convex n-dimensional distance-junctions which vanish 
only at the origin. 

Rogers (1950b) also proved a similar result for non-lattice packings 
and coverings, and indeed with the smaller constant instead of 
2 -n^n-i Before proving Theorem III we note the following 

Corollary. 

Ffb(F)^3"-i, 

where Vp is the volume of F(x) <\. 

For 1^(5 (F)^ 2” by Minkowski’s convex body theorem. 

Rogers proves Theorem III by considering a critical lattice M for F, 
that is 

F(M) = 1, cf(M) ={6(F)}-i. (1) 

We use the notation of § 1. 3 ; in particular 

m(x„) = inf Fix). 

o-=x„(M) 



1 When n is at all large, the results of this section are superseded by Rogers 
( 1959a). 




312 



Inhomogeneous problems 



As was shown in §1.3, there is then a point such that 

m (aCj) = sup ni (x^) 

X, 

= /.(M) 

= fx (say). 

Then 



w(3®i) g//, 

and so, since P‘(aj)<1 is bounded, there is an aeM such that 
F(Jx^ — a) = w(3®i) ^/z. 

Then 

F{x^~\a)^\fi<ii, 

and so \a is not in M. 

Let A be the lattice of points 



( 2 ) 



( 3 ) 



5-fyO, 5gM, r = integer, 
so 

Hence 

(F(A)}''^6(F)i(A)=i<5(F)rf(M) 

by the definition of d[F) ; that is, there exists a point b + --a=)=o of A 
such that 

{f(b + |a)}’’^-id(i^)<f(M). (4) 

We may suppose without loss of generality that r = Ooril. Ifr=0, 
we have 6 =j= o, and so 

F(h) ^ F(M) = 1 , 

and (1) and (4) are in contradiction. Hence r=ibl, and 

F[b ± la) =F{b±x^T (aJi-^a)} 

^F(b±x^) -F{x,-\a) 

- sA 

= !/«. 

by (2) and (3). 

On substituting (5) in (4) we obtain 

^^2-"3"-M(F). (6) 

Since the left-hand side of (6) is at most b(f), by the definition of b(F) 
as an infimum (§1.3), the theorem follows. 
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XI. 3. Transference theorems for convex sets. In this section we 
consider for a symmetric convex M-dimensional distance function F which 
vanishes only at o the relationships between the function 

^=^(A)=sup inf F(ac) (1) 

Xo x=Xo(A) 

discussed in § 1 and the successive minima Aj , of F with respect 

to A which were discussed in Chapter VIII. 

We first prove the inequality 

= 2/t ^ Aj + • • • + . (2) 

Let f>2 , . . . , be any basis for A. Then by the definition of and the 
fact that F(a;)<1 is bounded, there are vectors c^vA such that 

Hence the vectors dj — b^— 2Cy all satisfy 

F(d,.)^2/r. 

Since the dj are linearly independent, as is easily seen^ by considering 
congruences modulo 2. the left-hand side of (2) follows. 

We now prove the right-hand side of (2). There are linearly in- 
dependent vectors Oy of A such that 

F{a,)=k,. 

Every vector Xf, is thus of the shape 

®0 = fl«l+ +fn«n 
for some real numbers ii, Put 

a =«iai-] 

where 

and are integers. Then, clearly, 

F(x„-a) =F|2(f;-«>)0;} 

g|:|^y-«,|F(ay) 

^iZF{a,) 

=I2A,. 

1 For suppose that o, where the Vj are integers which, without loss 

i 

of generality, may be supposed to have no common factor. Then 2 q by = 2^ rj cy. 

f 

Since the by are a basis, all the ry must be even. A contradiction! 
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This proves the right-hand side of (2). 

On making use of the inequalities 

(3) 

of Theorem V of Chapter VIII, we may deduce estimates for ^ above 
and below in terms of 

^ = infF(a) =F[^). 

a^o 

€A 

From the left-hand sides of (2) and (3), and since 

(4) 

we have 

(5) 

On the other hand, the maximum of Aj-f for given and pro- 

duct Aj... is clearly attained when ki = ?, 2 = - - =A„_i. Hence, by (2) 
and (3), 

Vpkr^2/i-{n-\)k^}^2’‘d{A). ( 6 ) 

Both the inequalities (5) and (6) may be improved. The problem 
of obtaining an estimate above for /j. in terms of is an old one which 
has been attacked by many methods. The latest result due to Kneser 
(1955 a) and Birch (1956a) will be proved as Theorem V. The inequality 
(5) has attracted much less attention. We sketch a proof of an improve- 
ment due to Birch (1956b), as Theorem IV. Birch actually proves 
something slightly stronger than Theorem IV and gives examples to 
show that it cannot be much further improved. 

Theorem IV. 

for convex symmetric n-dimensional distance- functions. 

Birch’s proof is very simple. We may suppose after a suitable 
homogeneous linear transformation that A = Aq is the lattice of points 
with integer co-ordinates, and that 

F(0,...,0,1) =Ai. 

Let ^ be the (w — 1) -dimensional projection of the set 

11 . 9 ’: F(ac)^/i 

on to the hyperplane x„ = 0. Then every point with x„ = 0 is congruent 
modulo A() to a point of so y has (m — 1) -dimensional volume 
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Further, contains the points 

± (0, ...,0,/t/Ai). 



Some elementary geometry^ now shows that the volume oi must 
be at least 






> 



.JL.V (T\> 

A, = 



Since Vifi^) =//'Vp, and since we have assumed that A=Aq, so 
d(A) =i, the truth of the theorem follows. 



Theorem V. Let 



Q 



2”d(h) 

X'iVp 



where q is an integer and Then 



( 7 ) 



+ ( 8 ) 

Further 

( 9 ) 

provided that Q~^n. 

Note that qFx^l’'^Q and q^\ by (3). The inequality (8) is 
Kneser’s (1955 a) ^ and (9) is Birch’s, though the remark that (9) holds 
already for is Kneser’s [see Birch (1956a)]. Birch proves similar 
results involving other minima ■■■, A„_i. 

Before proceeding to the proof we note that (9) cannot be further 
improved®. Let 

F{x) =max{|%i|,...,|:r„|}, 
and let A be the lattice of points 

(Ml. Qu„), 



where Q is any number ^ 1 and run through all integers. 

Clearly 

Ai = l Vp = 2"-, 



and so Q is in fact the number given by (7). Further, /ji=\Q, as is seen 
by considering 

1 The details are given in the author's tract [Cassels (1957a)] page 84 Lemma 1. 
The easiest way is to replace y: F(x)< 1 by a body of the same volume symmetric 
in ;r„=0, on replacing for each (a-j, . . the segment of such that 
(rr, , .....rjey by the one of equal length symmetric in Jf„=0 (Steiner sym- 
metrization). The result is trivial for the symmetrized set. 

® Professor Kneser tells me that he can show that < can be substituted 
for ^ in (8) except when Q is an integer. 

’ Bambah (1958a) shows that (8) and (9) may sometimes be improved if d(F) 
is known. 
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It was long conjectured that (9) was valid for all Q, but the following 
example, due to Kneser and Birch (see Birch 1956 a), shows that in 
fact the weaker inequality (8) cannot be improved for 1^(?< 2. Let 

F(®) =max{|xj|,...,|x„|}, 

and let A be the lattice of points 

(Mj— £M2, Mj— £M3,..., U„_i — SU„, «„ + £%) 

where 0^£<1 is fixed and u^, runs through all integers (note 

the change of sign in the last co-ordinate). Then 

d (A) =!-}-£”, Aj = • • • = = 1 , (1 -)- £) , 



as is readily verified. No case appears to be known when (9) is false 
and Q^2. 

Now to the proof of Theorem V. We work in the quotient space 
^/A and use the notation of Chapter VII and of Theorem IV of Chap- 
ter VIII. In particular, we denote by S {t) the set of points 1) of 31 j A 
which have representatives y in such that F[y)<t. By Theorem IV 
of Chapter VIII the measure m{S(^)} satisfies 



m{S(<)} 



if 

if 



( 10 ) 

( 11 ) 



We shall also need the inequality 

m {S (<i + < 2 )} ^ min [m (S (q)} -f m {S (t^), d{A)], (12) 

for any <i^0, <2 = 0- This follows at once from the “Sum Theorem”, 
Theorem I of Chapter VII. Indeed, S (<i -f contains the sum S (fj) + S {Q , 
where addition of sets is as defined in § 3 of Chapter VII, since 
F[yi + y^<h + h if P{yi)<h and F{yi)<t^. 

We also remark that fj, is the lower bound of the numbers t such 
that m {S (f)} = (^ ( A) . Clearly m{S(<)}=(f(A) if every point of 3i is 
congruent modulo A to a point x with F{x)<t. Conversely, suppose 
that m{S(fo)}=(i(A). Let £>0 be arbitrarily small. Then m {S(£)}>0 
by (10), and so every point of ^/A belongs to S(fo) + S(£) CS(fo+e) by 
the first part of the “Sum Theorem” I of Chapter VII. 

We now prove (8) very simply. By (10) we have 

m{S(iA,)} = (|AO"Fp = e-'<f(A) 



m {S (ix‘/" Ai)} = X (iAi)" Vp = KQ-^d[A). 



and 
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Hence, b}? repeated use of (12), we have 

m [5 {Pi {q + ^qm (S(P,)} + m (S A,)} 

= {q + x) Q-^d{A) 

= d{A), 

as required. 

To prove (9) we need (11) as well as (10), where now 

Q^n. 



We must distinguish two cases. Suppose first that 






Then by (2), which proves (9) in this case. Otherwise, by (11), 



m 




> 






— d{A)ln, 



by the definition (7) of Q. Hence, by repeated use of (12), we have 



m{S(igAi)}^tf(A), 



which completes the proof of (9). 



XI. 3.2. We are now in a position to prove the result enunciated 
in § 1.3 that when the star-body F[x)<‘\ is bounded, then b(f) is an 
attained minimum, that is, in the notation of §1.3, there exists a lattice 
M such that 



<i(M) 



b(F) = inf 



{/^(A)r 

d{A) ■ 



We must use the transference theorem of §3.1 to ensure that we 
may apply Mahler’s compactness criterion. Write 



Fo(a;) = 1*1, 

so that 

F{x)>cF^{x), .00 



for some c and all x, since T'(ae)<l is bounded. Hence clearly 

^«»(A)^c"V(A), 

where the superfix indicates that the quantity is relative to i^. In 
particular, if n (A) is bounded above for some set 2 of lattices A, then 
so is (A) ; and hence Af’ is bounded below a strictly positive number 
by Theorem IV [or by the weaker inequality (5) of §3.1]; that is 

I A| = inf I a| 

' ' oca' ' 

+ 0 



is bounded below. 
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Now we select a sequence of lattices A, (1^r<oo) not necessarily 
distinct such that 

and 

{;r(A,)r->b(F). 

From what was proved in the last paragraph, |A,| is bounded below 
by a positive number. Hence by Mahler’s compactness criterion there 
is a convergent subsequence; without loss of generality 

A,^M. 



Then M clearly has the properties required. 

XI. 3 . 3 . Let A and A* be polar lattices in the sense of Chapter I, 
§5. It was there shown that a necessary and sufficient condition that 
a point X belong to A is that the scalar product xa* be an integer for 
all a*eA*. We develop now what may be regarded as a quantitative 
generalization of this statement. For a real number | we denote by 
Ilf II the difference between f and the nearest integer either above or 
below taken positively, that is 



ll^ll 



= inf 

m=0, ± 1 , ± 2 , 



I f — w| . 



There will be no possibility of confusion with the notation ||t|| where t 
is a homogeneous linear transformation. 

Theorem VI. Let F(x) be a symmetric convex n-dimensional distance 
function corresponding to a hounded set F[x)<\ and let F*{x) be the polar 
distance- junction. Let A and A* be polar lattices. For any point x^ write 



and 



m{x^ = inf F[x) 
x=ic„(A) 



K[Xq) = sup 

o*6A* 
+ « 



II a*a;oll 

F*[a») ’ 



0 ) 

( 2 ) 



where a*Xf, denotes the scalar product. Then 

1 2*1-1 w (aio) = -^(*0) = ^ (®o) • ( 3 ) 



The precise values of the constants in (3) are immaterial; what 
matters is that the ratio K{Xa)lm {x„) lies between constants. Theorem VI 
goes back in essence to Khintchine (1948 a). Kronecker’s Theorem 
follows from it in a few lines [compare Chapter V, § 8 of the author’s 
tract (Cassels 1957a), where a less general form of Theorem VI is 
given]. 
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VVe first prove the right-hand side of (3). Let 

®i = (A) . ( 4 ) 

Then Xya* differs from Xf^a* by the integer [x^ — x^a* and so 

||£Coa*|| = ||®ia*|l^|xia*|. ( 5 ) 

But now, by the definition of a polar function (Theorem III of Chap- 



ter IV), 


and since F{x) is symmetric, we have 






1 jCj a* 1 ^ F (®i) F* (a*) . 


(6) 


Hence 




||a!oC*l|^F(Xi) F*(a*), 


( 7 ) 


and so 




llajoa*!!^ w(Xo) F*(a*), 


(8) 



on taking the infimum of the right-hand side of (7) over all x^=Xq{N). 
This is just the right-hand side of (3). 

To prove the left-hand side of (3) we need the dual bases 5 y and hf 
of Theorem VII, Corollary of Chapter VIII, for which 

F(b,.)F*(bf)^(ir !(«!)* (9) 

Let be any point, so that 

= ^1 H + In 

for some real numbers fy. Then, by (2), 

111,11 = l|b>„||^F(af„)F*(b*) (10) 

for Choose integers «y so that 

I“f-Ifl=lllfl|. (ff) 

and let 

= (li - Ml) H h (In - M„) b„, 

SO 

= x„ (A) . 

Then by ( 9 ), (10) and (11), 

m{x„)^F(Xi) 

=zm\m 

1 

^F(»o)2F*(b*)F(by) 

i 

= -^{n\rK{x,), 

which is the left-hand side of (3). 
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XI. 3. 4. In this section we prove a rather specialized transference 
theorem which we shall need in § 4- The proof uses the so-called tech- 
nique of the additional variable which has often been used with success^. 
For example, the best result in the direction of Theorem V until the 
work of Kneser was proved by Hlawka (1952a) using this technique. 
[It is reproduced in the author’s tract (Cassels 1957a) in a special case.] 



Lemma 1 . Let F^{x) = | ® ] , where x fs a 3-dimensional 

vector. Let ^ be the successive minima of a lattice A with respect 

to Ff, and let 

f .1 = sup inf (x) . 

Xa X2£Xo(A) 

Then 



and 



^ _ ! K hh '? 

X\- \2d(h)} 



( 1 ) 



( 2 ) 

We first prove (2). There are linearly independent points Oy of A 
such that \a^\=Xj. Let Cj , Cj , Cg be a set of mutually orthogonal 
vectors such that 



ai = Ci 

flg = ^21 “F ^2 

~ ^3 1 ^1 T 2 "h 



( 3 ) 



for real numbers y . Then 



\c^\^^\a^\^ = Xj (1^?^3). (4) 

But now, if Xq is any point, it is possible to choose integers Mg, «2> 
successively in that order, so that 



a?! = *0 + MlOi -f MjOg -f MgCg = Cj -f Cg + |g Cg , 



where the numbers fy satisfy 
Hence 

by (4). This establishes (2). 

We now construct a 4-dimensional lattice M as follows. There is a 
point Xf, such that 

A=a(A)= inf >(. (5) 

xsxo (A) 



^ Apparently first used by Mordell (1937a). 
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Let the number q be defined by 

+ = ( 6 ) 

SO 

G = 2 ( 7 ) 

by (2). Then M is the set of all 4 -dimensional points 

X=ix,Qu), (8) 



in an obvious notation, where u runs through all integers and the vector 
X satisfies the congruence 

x = uXg(A). ( 9 ) 



Clearly 



d{M)=Qd(A). 

If Xe M and M =)= 0 we have 



by ( 5 ) and (6) or by ( 7 ) according as m = T;1 or |«|>t. The values 
taken by |X| with «=0 and XgM are precisely those taken by |a;| 
with x&A. Hence the four successive minima of the function |^| with 
respect to M are where Xi, X2, X3 are the minima of |a;| 

with respect to A, as already defined, and 



X,^X2. 



(Indeed ^4 = ^3, but we do not need that.) 

By Theorem I of Chapter VIII and Theorem IV, Corollary of 
Chapter X, we have 



^1 ^2 X3 ^ Xi X2 Xg x^ 
= 2«f(M) 



(to) 



= 2Qd{A), 



where j is the lattice-constant of the 4 -dimensional sphere | X | < 1 . On 
eliminating q between (6) and (10), we obtain the required inequality (1). 
We shall actually need Lemma 1 in the following shape: 

Corollary. To every point Xq there is a point Xi = Xq (A) such that 





(11) 


In the first place. 




3 e -f = « -)- e -f e -|- e”® ^ 4 


(12) 



for every number e>0 by the inequality of the arithmetic and geometric 
mean. Hence it follows from (1) that is at most equal to the 

Cassels, Geometry of Numbers 21 
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righthand side of (II), on using (12) with 




But now, since |aj|<1 is bounded, there is certainly an such that 

lacil = inf lacl g «; 

and the corollary follows. 

XI.4. Product of n linear forms. Let 

F^(x) = \x^...x„\^l''. (1) 

As in § 1 .4 we put 

i«i(A)=sup inf -Fi(®), ®j = supi^^|l^. (2) 

Xo xsXo(A) A 

There is a famous conjecture of Minkowski that 

= 2 -”. ( 3 ) 

That follows at once by considering the case when A=A(, 

in (2) is the lattice of points with integer co-ordinates and ajj = (-| , . . . , ^) . 
Clearly then itj(aCo)^-| for all x = x^{\), and (i(Ao)=1. 

It is well known that 

if A is a sublattice of the integer lattice Ag. The proof is simple. The 
lattice A has a basis 

where the 6,y are integers and 6yy=)=0, ^><, =0 for i>j. For any real 
numbers [x^g, ..., x,^^ we can thus choose integers Mj, ..., m„, in order, 
so that 

I My +M„ iy„ + «yo| ^i|*yy|- 

For ®i=MibiH |-M„6„-fa:o, we then have 

{F(«,)}“^{i|^il}...{il6„„|} = 2-”i(A), 

as required. 

The conjecture (3) has been proved only for «=2, 3,4. A great 
many proofs of (3) for « = 2 for have been given; we shall present one in 
§ 4.2 due to Sawyer. This has the advantage that it gives naturally a 
result for the "asymmetric” distance function^ 

P if x^Xg-^0\ 

\l\XyXg\^ if XiXg^Ol 

I Of course F* ; (®) < f symmetric about o ; but it is not symmetric in the 

four quadrants. 
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where k and I are positive numbers. These arise quite naturally even in 
originally symmetric problems; indeed the result we shall prove was 
first obtained by Davenport (1948a) as a tool in his work on the ^‘sym- 
metric” problem for indefinite ternary quadratic forms. Further results 
about i have been obtained, notably by Blaney (1950 a), Barnes 
and Swinnerton-Dyer (1954a) and, as an adjunct to another investiga- 
tion, by Barnes (1956a). We refer the reader to these papers for further 
details. 

When w = 3, Minkowski’s conjecture (3) was proved by Remak 
(1923 a, b) and a simplified proof was given by Davenport (1939a). 
We give Davenport’s proof in § 4.3, having already paved the way in 
§ 3.4. A proof for « = 3 using different ideas has been given by Birch 
and Swinnerton-Dyer (1956 a). 

When w = 4 a proof of (3) has been given by Dyson (1948 a) following 
the same general line as Remak’s proof. It is an extremely powerful 
piece of work and requires tools from topology as well as from number- 
theory proper. 

For n> 4 only estimates for are known. It was shown by Tsche- 
BOTAREW (1934a) that 

and this was improved by Mordell (1940a) and by Davenport (1946a) 
to 

where s a number <1 such that ?;„^(2e — 1)"^ as w->-oo. Recently 
Woods (1958c) has shown that Tschebotarew’s result may be im- 
proved simply by using Blichfeldt’s theorem instead of Minkowski’s 
convex body theorem. Mordell (1959a) remarks that this improvement 
can be combined with the earlier techniques. In particular, Davenport’s 
ri„ can be replaced by a number which is asymptotically for large rj. 
We give Tschebotarew’s result with its impressively simple proof in 
§4.4. 

Some further results of a general nature are known about this 
problem. Birch and Swinnerton-Dyer (1956a) have shown that 

K(A)}«^2-“(f(A) 

for all lattices A in a certain neighbourhood of the integer lattice Aq, 
and give some other facts relating to the general conjecture. The author 
(Cassels 1952b) has shown that for any £>0 and every n there are 
infinitely many lattices A such that 

{/^i(A)}”^(2-»-£)(f(A) 



21 * 
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and such that no two lattices A, A' of the set are of the type A' — ttaA, 
where t is real and to an automorph of (x ) ; so if Minkowski’s con- 
jecture is true then the first minimum is certainly not isolated. Rogers 
(1954c) has investigated the least number ju[(A) such that for every 
£>0 and every x„ there are infinitely many solutions of 

F^{x) < n[{A) + e x = Xa[A). 

and obtained general conditions for A under which /ii(A) =^^(A). 

Chalk (1947 a, b) has obtained the complete answer for what may 
be regarded as an extreme asymmetric version of Minkowski’s problem. 
He shows namely that for any lattice A and any point x^ there is an 
aci= (xii, ..., x„i) =x^ (A) such that 

^yi>0 {i^j^n), (4) 

Xii...x„i^(i(A). (5) 

That ^ in (5) cannot always be replaced by < is shown by the simple 
example when A=Aq is the lattice of integer vectors and Xq=o. The 
case n = 2 was obtained by Davenport and Heilbronn (1947 a). When 
n = 2, Blaney (1957a) has given an interesting strengthened form: 
namely that for every O!, there is an Xy={x-^^, Xji) =®o (A) such that 

^,i>0 (? = 1,2) 

and 

^ (126^ — 11) d{A) < %iX2i ^d[A), 



where the ^ on the left cannot be replaced by < for a certain lattice 
A. The proof is a classic example of the local methods discussed in 
general terms in § 8 of Chapter X. Cole (1952a) has shown that to 
every cb# there is an x^ = x^ such that 



Xji >0 (t • 1) 

and 

Xii... ^ \d[A) . 



Chalk (1947b) discusses when for given x^ there are infinitely many 
aCi = aej(A) satisfying (4) and (5). The principle behind the proof of 
Chalk’s theorem is similar to Tschebotareff’s, and we prove it in 
§ 4.4. The idea has been put in a much more general form by Macbeath 
(1952a) and C. A. Rogers (1954b), but we do not go into that here. 

XI. 4.2. The proof of Minkowski’s conjecture in 2-dimensions may 
be made to depend on the following lemma due to Delaunay (1947 a). 
He used it as a tool to investigate [ii{A) (in the notation of § 4.1) for 
individual 2-dimensional lattices A; and the so-called "algorithm of the 
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divided cell” has been exploited further by Barnes and Swinnerton- 
Dyer (1954a), and Barnes (1954a, 1956c). It was remarked by 
Delaunay (1947 a) that the lemma does not generalize to 3 or more 
dimensions; and the same counter-example in 3 dimensions was given 
by Birch (1957a) in ignorance of Delaunay’s example. 

Lemma 2. Let A be a 2 -dimensional lattice and let be a -point not 
congruent modulo A to a point on either co-ordinate axis. Then there are 
4 points , a?2 , ®3 , *4 , each congruent to x„ modulo A, where Xj is in the 
j-th quadrant, so that 

®l + ® 4=®2 + ®3 (1) 

and *2—®!, X3—X1 is a basis for A. 

The four points a^i, a?2, *3, forms a “divided cell” of the grid @ 
of points X = a?o (A) . Simpler proofs of Lemma 2 have been given by 
Bambah (1955b) and Redei (1959a). We follow Redei. 

The proof depends on the following two propositions. 

Proposition 1 . Let J/4 be four points of @ such that the 

quadrilateral y^y^yzy^ is convex and contains no other point of @ in its 
interior or boundary. Then 1/1^21/31/4 is a parallelogram and ^3— i/i, 
1/3— 1/1 is a basis for A. 

This follows almost at once from Chapter III, Lemma 6. 

Proposition 2 . Let tt be a line containing points of @ in 3 quadrants. 
Let 1/1 be a point of @ in the remaining quadrant. Suppose that there are 
points 1/2, 1/3 of ® omx such that y^y^, 1/3 are the only points of the closed 
triangle yiy^ys in @. Then Lemma 2 is true. 

For the line tt' through and parallel to it also contains points 
of @ in three quadrants. It is then easy to pick out a divided cell with 
a pair of opposite sides on rr and ir'. 

We now revert to the proof of Lemma 2. We can find 4 points 
«i, «2, *3. 2^4, with Zj in the /-th quadrant, such that the (not necesarily 
convex) closed quadrilateral «i«2®3*4 contains as few points of @ as 
possible. The following three cases are all that can occur. 

(i) The quadrilateral z^ z^ z^ is convex. It is then a split parallelo- 
gram by Proposition 1. 

(ii) Three of the points «i, «2. *3. «4 are coUinear. If, say, z^, Zg, z^ 
are on a line tt, then Lemma 2 follows from Proposition 2 applied to 

and TT. 

(iii) One point, say z^, is an inner point of the convex cover of the 

remaining three. By the minimal defining property of z^, Zg, Zg, z^, 
any point of @ in the closed quadrilateral «2 *4 other thanSj, »4 

must be in the first quadrant; and such points exist since z^ is one. 
We may thus choose a point t of @ in the first quadrant and in the 
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triangle *4 . such that the only points of @ in the closed triangle 

«3 1 are the vertices. Lemma 2 now follows from Proposition 2 on 
putting 

yi = «3> ya = «2. ya=t 

Since we have now disposed of all three cases, this concludes the 
proof of Lemma 2. 

Corollary. If Xj = {x^j,x^j) then 

/ 

For the area of the divided cell is d[t\). It is also the sum of the 
areas of the four triangles with vertices o, Xj , (1 ^ ^ 4 ; *5 = Xj) . 

But now the area of is 

^2,i+i\ + |%,)+i 

and so 

2<^(A) = 2 1%; ^2., -+11 + 2 \Xii ^l,, + l| • 

i i 

The required inequality now follows on applying the inequahty of the 
arithmetic and geometric means to the 8 terms on the right-hand side. 

We can now prove Davenport’s generahzation of Minkowski’s 
conjecture for n=2. 

Theorem VII. Let q, a be positive numbers and 

i6qa^ I . 

Then to every 2-dimensional point x^ and every lattice A there is a point 
x'^Xq (A) such that 

— q d{A) ^ x'ix' 2 ^ a d{A) . (2) 

The case p=(r = ;^ is, of course, Minkowski’s conjecture^ for n=2. 
When Xj is congruent to a point on an axis modulo A, there is nothing 
to prove. Otherwise we show that one of the four points x, 
given by Lemma 2 will do. If not, we should have 

\XiiX2x\> O d[A) , 1^3^231 > (A) . 

i% 2 ^ 22 |>e^(A), |%4^24i >q d{Ay, 

which is in contradiction with Lemma 2, Corollary. 

The reader should not find it difficult to verify that when g =<r = J 
the only case when the equahty signs are needed in (2) is when 
A = < <«) A, and x^ = < u> (-j , |) ( A) , where f > 0, u) is an automorph of % x^ and 



1 Proved by Minkowski in this case. 
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Aq is the lattice of integers. Davenport (1948 a) showed that the equahty 
signs may be needed when g 4 ^a. On the other hand it follows from 
Chalk’s Theorem of § 4.4 that something stronger is certainly true if 
p>l or a>\ \ and Blaney (1950a) has given stronger results which 
cover the cases when p or <r is near 1. 

XI. 4 . 3 . We now give the Remak-Davenport proof of Minkowski’s 
conjecture in 3 dimensions, which depends on the following 

Lemma 3. Let A be any 3 -dimensional lattice. Then there exist 
numbers pj> 0 , (l^/^ 3 ) that there are no points of A other than o 
in the ellipsoid 

S\ pix\ + p.^x\ + p^»^^<\, (1) 

but there are three linearly independent points of A on the boundary of S. 

We call the elhpsoid S free if o is the only point of A in it. We shall 
assume that a free elhpsoid cannot have three hnearly independent 
points of A on the boundary, for some particular lattice A, and will 
ultimately deduce a contradiction. 

We note first that 

A/>2^3^(ff{‘^(A)}-^>0 (2) 

for any free elhpsoid, by Minkowski’s convex body theorem: the 
constant in (2) is not important ; ah that is important is that it is positive. 

Secondly, if ±Oi> i®2. ±<*3 are three hnearly dependent pairs of 
points of A on the boundary of a free elhpsoid, we must have 

i ®i i ®2 i O3 = O 

for some choice of the three ± signs, since the ±0,- he on a plane through 
the origin and so are points of a 2-dimensional lattice on the boundary 
of an eUipse which contains no point of the lattice (Theorem XI of 
Chapter V). 

Thirdly, under our hypothesis, if there are two pairs of points 
rhCi and ±0^ of A on the boundary of a free elhpsoid, they cannot 
both he in the same co-ordinate plane, say, Xj = 0 . For then we should 
have 

^2^*21 ^31 — t . ~ (0) *21> ^ 1 ) 

^2^22 T~ ^’3^32 — 1 > ®2 ~ (0> ^22> ^32) • 

If p^ is decreased but p^, p^ kept constant, the points a^, remain on 
the boundary and the volume of the elhpsoid increases. Ultimately 
there must come a third point on the boundary for some value of p^, 
since it is impossible to decrease to 0 without a point of A entering 
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the ellipsoid, by (2). Hence for some the ellipsoid is free but there 
are points a^,a^,a^ on the boundary, where Oj is not on a;i=0. This 
contradicts the hypothesis whose absurdity we wished to prove. 

Fourthly we show (on our hypothesis) that if there is a free 
elhpsoid (1) with the points ±ai, on the boundary, then there 

is one with ±Oi, and ±(Oi + ®2) on the boundary. For put 
03=0^+02, 04 = 01—02, and write 

«/ = K,-.«2y.«3/) (t^/^4). 

Then 

^ia?, + ^2«2, + ^3«i,j ^ I (3) 

(? = 3.4).J 

There are numbers ^i, ^2> % not all 0 such that 

<h <A.j + ?2 «l; + ?3 «i; = 0 (/ = f . 2) , (4) 

and after a change of sign, if need be, we may suppose without loss of 
generahty that^ 

9l <*14 + ?2 *24 + ?3 ®34 = 0. (5) 

We now consider the ellipsoids 

(Pi + tqi) x\ + (P2 + tqi) 4 + (p3 + tq^) x% = 1 

where 

Since at least one of q\,q^, q-^ is negative by (4), as t increases from 0 
the inequahty (2) with pj + tqj for p^ must fail for some t\ so there 
must be some value of t at which for the first time a lattice point enters 
the eUipse S. This cannot be O4, by (5), and so must be ±®3= ± (®i+®2) 
by the second remark; which concludes the proof of the fourth remark. 

We now prove the lemma. It is clear that we can obtain free ellipsoids 
with two pairs of points ±®i. it®2£A on the boundary by varying the 
parameters py appropriately. By the fourth remark, there is then a free 
eUipse with a^,a^,ai-\-a^on the boundary. Then by the fourth remark 
apphed to Oi and 01+02 there is a free ellipse with 0i,0i + 02 and 
20i + 0j on the boundary. By induction, there is an elhpsoid 

p(»);r? + pW^^ + p<'*>4<f 

with Oi, nOi+Oj, (« + 1)01 + 03 on the boundary. In particular, 

p<“) (m*ii + * 12 )® + ^ 2 "’ (" «21 + * 22 )*“ + Pz^ {» *31 + * 32 )* = f • (6) 

* It is readily verified that there cannot be equality in (5), since the deter- 
minant of the three forms in q^, q^ in (4) and (5) does not vanish. But we do 
not need this. 
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We distinguish three cases. Suppose, first, that fl]^i=j=0, fl2i=t=0, a3i=)=0. 
Then, by (6), 

(/ = 1 , 2 , 3 ) (m-^oo), 

in contradiction to (2). Suppose now that precisely one of a^, a^i, 
vanishes, say, an = 0, a2i4=0, «3x=t=0. Then by the third remark above 
we have fli 2 =f= 0 - ®o 

(? = 2,3), 

again in contradiction to (2). Finally, suppose that two of a^x, a^x 
vanish, say, aji = a 2 i = 0=f=^3i- Then aj2+0d=«22> ^nd so 

again in contradiction with (2). Since we have reached a contradiction 
in every case, we have proved the absurdity of our initial hypothesis 
and so the lemma is true. 

Minkowski’s conjecture for m=3 now follows in a few lines from 
Lemma 3 and Lemma 1 Corollary. 

Theorem VIII. Let A be any 3-dimensional lattice and any point. 
Then there is an aCi=(%i, % 2 i. ^ 3 i)=®o (^) 

1^11 ^21 %i| = 1 ^ (^) • (7) 

Let px.pi, Pz be the numbers given by Lemma 3 , so that A has no 
point in />i3fi+^2*2+^3^3<L but three linearly independent points 
on the boundary. Hence the three successive minima of A with respect 
to the distance-function 

F (x) = (px 4 + p2 4 + Pz x\) ^ (8) 

are 

= ^2 = A 3 = 1 . (9) 

We may now apply Lemma 1 Corollary to the lattice M of points 
{p\xx,p\x^,p\x^). {xx,x^,x^)ah, 

with determinant 

dm = {pip2pz)^m 

and with successive minima with respect to |a!| given by (9). Hence to 
any there is a congruent Xx such that 

x!x + Pzx!i-hpzxlx^} 

The required inequality (7) now follows at once from (10) and from the 
inequality of the arithmetic and geometric means. 
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The reader should have no difficulty in showing that the sign of 
equality in (7) is required only when A=ttaA^ for some number f=f=0, 
and some automorph ta of where A# is the lattice of points with 

integral co-ordinates; and then only for (A). Note 

that to have equality in (7) one must have equahty in both applications 
of the inequahty of the arithmetic and geometric means; that going 
from Lemma 1 to Lemma 1 Corollary and that going from (10) to (7). 

XI. 4.4. We now prove^ the theorems of Tschebotarew and Chalk. 
Since Chalk’s theorem is shghtly simpler, we prove that first. 

Theorem IX. Let A be an n-dimensional lattice and a point. 
Then there is an ..., (A) such that 

[i£j£n), ( 1 ) 

Xii...x„i'^d(A). ( 2 ) 



There certainly is a point ® 2 =(% 2 > ^» 2 ) =®o (^) which 

^/2>0 (1^/^«). (3) 

If Yl Xj 2 ^d{A), then we may put x^=x^. Otherwise we have 

IJ d (A) , (4) 



and so, by Minkowski’s convex body theorem, there is a point a=)=o 
of A such that 






(5) 



By considering 2' a instead of a with a suitably chosen integer r^O, 
we may suppose, further, that 

( 6 ) 



for at least one integer /. Then the two points 
X2±a = x^ = {xt,...,xt) 



are both congruent to x^ and lie in the quadrant A;y>0 
Further, 



i i 



n 






since by (5) and (6) every factor on the right-hand side is ^1, and one 
at least is ^ | . Hence choosing for x^ that one of x'^ for which /J 
is least, we have 



a;,-3>0 (1^7 ^k); (i)^J7^,'2- 

i i 



^ Following Macbeath (1952a), but in our special cases the argument can be 
simplified. 
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If 77 ^;3 ^d{A), then we may put Xi=x^. Otherwise we repeat the 
process with x^ instead of *2 and obtain an x^ with 

X,i>0 (1^/gM); 77^,4^ (1)^77 77*, 2- 

; I i 

And so on. Clearly an Xi is reached in a bounded number of steps, 
with a bound that can be given explicitly in terms of 77 *, 2 - This 
concludes the proof. ’ 

A similar idea gives Tschebotarew’s 

Theorem X. Let A be any n-dimensional lattice, e an arbitrarily 
small number and £Cq a point. Then there is a point *1= (xjj , . . . , x„j) s x^ 
(A), such that 

|*ii--.*„i|^(2-"/^+£)^(A). (7) 

Let t be the number such that 



(2-"'^ + e) r = 1 , 



( 8 ) 



SO 



o<t<2K (9) 

If 77|*/ol ^(2 ”^’®+e) (A), there is nothing to prove, so we may 
i 

suppose that 



77|*fo|>(2-"/^+«)^(A) 



= t-’'d{A). 



( 10 ) 



By Minkowski’s convex body theorem, there is a point a=(=o in A for 
which 



|a,| (1^/^«). 



( 11 ) 



As in the proof of Theorem IX, we may suppose, on taking 2'a with 
suitable integer r^O, that 





I*/I = i^l*/ol 




for some /. Put 


=®o±a. 




so that 


1 


/I* \ 




I 

II 

e 

H 

' 


3T? J 

to/ 


But 


af 

- 1< 1 - ^ 1 L- 

10 

Further, 


^ 1 


by (9) and (11). 





4 

i-f2^t--7-^i -— <2 

“ 4 



( 12 ) 



( 13 ) 
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by (1 2) . Hence, on taking for that one of £C* for which ri\Xj \ is least , 
we have 

n\xi2\^^n\xio\. 

i i 

where 

= max{| f — , |l— 

As in the proof of IX we reach an satisfying (7) after a finite 
number of steps, the number of steps being bounded by a number 
depending only on n, and £. This concludes the proof. 



Appendix 

In this appendix we list the lattice constants of some sets connected 
with quadratic forms and give further references and some additional 
comments. We write 



9’r,s(®) = H 1- - ^r+l As. 

and denote by F, ^ the lattice constant of the set 

kr.sl <"* 

in ^-dimensional space, where 

w = r -f s. 

Results about definite forms are usually given in terms of y„ where 
• The first 8 values are known: 

yi = i, yl = i. ri = 2, y\ = 4, 

yl = 8. y? = 64, y| = 2«. 

The value of is trivial ; the values of y 2 ,ya, yt have been found 
in this book (Chapter II, Theorems II,. HI and Chapter X, Theorem IV, 
Corollary). For references and a hst of the corresponding critical forms 
see Chaundy (1946 a), who gives proofs that y 9 = 2®, yJo = 2^®/3; but 
Chaundy’s proofs contain a lacuna. Presumably his Une of argument 
would lead to incorrect results by « = 12; see Coxeter and Todd (1953 a) 
for a special form in 12 variables. 

For indefinite forms we have 



_ 6 
■'1.1 — 4 

p2 p2 3 

■* 2,1 — -* 1,2 — 4 

p2 9 

-* 2, 2 — 4 

P2 _ p2 __ 7 
^3,1 — -*1,3 — 4 
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due to Hurwitz, Markoff, Oppenheim and Oppenheim respectively, 
the proofs being reproduced in Dickson ( 1930 a). We have proved all 
except the last line in the book (Chapter II, Theorems IV, VII and 
Chapter X, Theorem IV, Corollary). All are isolated. The successive 
minima of |y)j j |<1 are the Markoff Chain (see Chapter 2 , §4). The 
first 11 minima for |932 i|<1 and the first 7 minima for 1 952 , 2 ! <f have 
been given by Venkov (1945 a) and Oppenheim (1934a) respectively. 
It is conjectured that I 9 ?, ,|<1 is of infinite type when r>0, s>0, 
r+s^5, see Davenport (1956a)h 
Let B, j be the lattice constant of 



Then 



0 < 95, , s < 1 • 



B 



2 _ 1 
1.1 4 



B 



2 

2,1 



1 
4 > 



B 



2 _ 4 
1,2 — 27 



BU 



X 

16 ' 



B 



2 _ 3 
3.1 — 16" 



B 



2 _ 27 

1,3 — 266 • 



The value of Sj j is given by Theorem V of Chapter II. The results in 
the second row are due to Davenport ( 1949 a); both are isolated and 
something is known about further minima, see Oppenheim (1953 ^)- 
The results in the third row are due to Oppenheim (1953 b) and again 
something is known about successive minima. In all cases the critical 
lattice has points a 4 = o at which 95, 5 (a) = 0. 

Let A, j be the lattice constant of 



Then 



0g9?,,,<1. 



A 



2 _ 1 
1,1 4 



A 



2 

2, 



3 

1 — 4 > 



A 



2 _ 5 

1.2 — 16 



The value of Aj j follows at once from Theorem VI of Chapter II. The 
rest are due to Barnes (1955a) and Barnes and Oppenheim (1955a). 

If a quadratic form in « ^ 3 variables takes arbitrarily small non-zero 
values of one sign then it also takes arbitrarily small values of the 
other sign. If a quadratic form represents 0, has two of its coefficients 
in an irrational ratio and has « ^ 5 variables, then it takes arbitrarily 
small values of both signs (Oppenheim 1953 c, d). 



^ For later work on this problem, mainly due to Davenport and Birch, see 
Ridout {1958a). 
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